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Bruno et Exp. HematoL, 1 9:378.381 [1991]; and Yonemura et al., Exp. 
HematoL, 20:1011-1016 [1992]). 

Other documents of interAcst m^i.,^«. c . . 
* „ M M * ~ merest include: Eppstein et a/., U.S. Patent No. 

4.962.091: Chong. U.S. Patent No. 4,879.111; Fernandas eta,.. U.S. Paten, No. 
4 604,377; Wissler et a/.. U.S. Patent No. 4,512,971; Gott.ieb, U.S. Patent No. 
4.468,379; Bennett et a,., U.S. Patent No. 5,215,895; Kogan et a,., U.S. Patent No. 
5.250.732; K.mura et al., Eur. J. Immunol., 20(9): 1927-1931 [1990]; Secor et 
al., J. of Immunol., 144(4): 1484-1 489 [1990]; Warren et a/., J. of Immunol 
140(1):94-99 [1988]; Warren et a,., Exp. Hemato,., 1 7(1 1 ) : 1 095-1 099 
[1989]; Bruno et al.. Exp. HematoL, 17(10,:1038-1043 [1989]; Tanikawa et al 
Exp. HematoL, 17(8):883-888 [1989]; Koike et al.. Blood, 75(12) 2286-2291 
[1990]; Lotem, Blood. 75(5):1545-1551 [1989]; R enn ick et al.. Blood 
73(7):1828-1835 [1989]; and Clutterbuck et al., Blood, 73(6):1 504-1 51 2 
[1989]. 

III. Thrombocytopenia 

■ Platelets are critical elements of the blood clotting mechanism. Depletion of 
the circulating level of platelets, called thrombocytopenia, occurs in various cilnica. 
condrtions and disorders. Thrombocytopenia is commonly defined « a platelet count 
below 150 X 109 per liter. The major causes of thrombocytopenia can be broadly 
d.vided into three categories on the basis o, platelet life span, namely; (1) impaired 
production of platelets by the bone marrow. (2) platelet sequestration in the spleen 
(splenomegaly), or (3) increased destruction of platelets in the peripheral 
emulation (e.g.. autoimmune thrombocytopenia or ohemo- and radiation-therapy) 
Additionally. In patients receiving large volumes of rapidly administered platelet-poor 
blood products, thrombocytopenia may develop due to dilution. 

The clinical bleeding manifestations of thrombocytopenia depend on the severity 
of thrombocytopenia, its cause, and possible associated coagulation defects. In general 
pafents with platelet counts between 20 and 100 X 10* per liter are at risk of 
excessive post traumatic bleeding, while those with platelet counts below 20 X 10* 
per liter may bleed spontaneously. These latter patients are candidates for platelet 
transfusion with attendant immune and viral risk. For any given degree of 
thrombocytopenia, bleeding tends to be more severe when the cause is decreased 
production rather than Increased destruction of platelets. In the latter situation 
accelerated platelet turnover results in the circulation of younger, larger and 
hemostatics more effective platelets. Thrombocytopenia may result from a variety 
of disorders briefly described below. A more detailed description may be found In 
Schafner. A. I.. Thrombocytopenia and Disorders of Platelel Function." Internal 

• 6 - 
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Medicine, 3rd Ed., John J. Hutton it a/., Eds., Little Brown and Co., Boston/Toronto/ 
London [1990]. 

(a) Thrombocytopenia due to impaired platelet production 
Causes of congenital thrombocytopenia include constitutional aplastic anemia 
(Fanconi syndrome) and congenital amegakaryocytic thrombocytopenia, which may be 
associated with skeletal malformations. Acquired disorders of platelet production are 
caused by either hypoplasia of megakaryocytes or ineffective thrombopoiesis. 
Megakaryocyte hypoplasia can result from a variety of conditions. Including marrow 
aplasia (including idiopathic forms or myelosuppresslon by chemotherapeutic agents 
or radiation therapy), myelfibrosls, leukemia, and invasion ol the bone marrow by 
metastatic tumor or granulomas. In some situations, toxins, infectious agents, or 
drugs may interfere with thrombopoiesis relatively selectively; examples include 
transient thrombocytopenias caused by alcohol and certain viral infections and mild 
thrombocytopenia associated with the administration of thiazide diuretics. Finally, 
ineffective thrombopoiesis secondary to megaloblastic processes (folate or B12 
deficiency) can also cause thrombocytopenia, usually with coexisting anemia and 
leukopenia. 

Current treatment of thrombocytopenias due to decreased platelet production 
depends on identification and reversal of the underlying cause of the bone marrow 
failure. Platelet transfusions are usually reserved for patients with serious bleeding 
complications, or for coverage during surgical procedures, since isoimmunization may 
lead to refractoriness to further platelet transfusions. Mucosa) bleeding resulting 
from severe thrombocytopenia may be ameliorated by the oral or intravenous 
administration of the antifibrinolytic agents. Thrombotic complications may develop, 
however, if antifibrinolytic agents are used in patients with disseminated 
intravascular coagulation (DIC). 

(b) Thrombocytopenia due to splenic sequestration 

Splenomegaly due to any cause may be associated with mild to moderate 
thrombocytopenia. This is a largely passive process (Hypersplenism) of splenic 
platelet sequestration, In contrast to the active destruction of platelets by the spleen in 
eases of immunomedlated thrombocytopenia discussed below. Although the most 
common cause of Hypersplenism is congestive splenomegaly from portal hypertension 
due to alcoholic cirrhosis, other forms of congestive, infiltrative, or 
lymphoprotiferatrve splenomegaly are also associated with thrombocytopenia. Platelet 
3 5 counts generally do not fall below 50 X 1q9 per Uter as a result of Hypersplenism 
alone. 
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(c) Thrombocytopenia due to nonimmune-mediated platelet destruction 
Thrombocytopenia can result from the accelerated destruction of platelets by 
various nonimmunologic processes. Disorders of this type include disseminated 
'intravascular coagulation, prosthetic intravascular devices, extra corporeal 
circulation of the blood, and thrombotic microangiopathies such as thrombotic 
thrombocyte purpura. In all of these situations, circulating platelets that are exposed 
to e.ther artificial surfaces or abnormal vascular intima either are consumed at these 
sites or are damaged and then prematurely cleared by the reticuloendothelial system. 
Disease states or disorders in which disseminated intravascular coagulation (DIC) may 
arise are set forth in greater detail in Braunwald et al. (eds), Harrison's Principles of 
Internal Medicine, 11th Ed., p. 1478. McGraw Hill [1987]. Intravascular prosthetic 
devices, including cardiac valves and intra-aortic balloons can cause a mild to 
moderate destructive thrombocytopenia and transient thrombocytopenia in patients 
undergoing cardiopulmonary bypass or hemodialysis may result from consumption or 
1 5 damage of platelets In the extra corporeal circuit. 

(d) Drug-induced immune thrombocytopenia 

More than 100 drugs have been implicated In Immunologically mediated 
thrombocytopenia. However, only quinidine. quinine, gold, sulfonamides, cephalothin 
and heparin have been well characterized. Drug-induced thrombocytopenia is 
frequently very severe and typically occurs precipitously within days while patients 
are taking the sensitizing medication. 

(e) Immune (autoimmune) thrombocytopenic purpura (ITP) 
ITP in adults is a chronic disease characterized by autoimmune platelet 

destruction. The autoantibody is usually IgG although other immunoglobulins have also 
been reported. Although the autoantibody of ITP has been found to be associated with 
platelet membrane GPII b llla. the platelet antigen specificity has not been identified in 
most cases. Extravascular destruction of sensitized platelets occurs in the 
reticuloendothelial system of the spleen and liver. Although over one-haJf of all cases 
of ITP are idiopathic, many patients have underlying rheumatic or autoimmune 
diseases (e.g., systemic lupus erythematosus) or lymphoprollferative disorders (e.g., 
chronic lymphocytic leukemia), 
(f) HIV-lnduced ITP 

ITP Is an increasingly common complication of HIV infection (Morris et al 
Ann. Intern. Med.. 96:714-717 [1982]). and can occur at any stage of the disease 
progression, both in patients diagnosed with the Acquired Immune Deficiency 
Syndrome (AIDS), those with AIDS-related complex, and those with HIV Infection but 
without AIDS symptoms. HIV infection is a transmissible disease ultimately 
characterized by a profound deficiency of cellular immune function as well as the 
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occurrence of opportunistic infection and malignancy. The primary immunologic 
abnormality resulting from Infection by HIV is the progressive depletion and 
functional impairment of T lymphocytes expressing the CD4 cell surface glycoprotein 
(Lane et al.. Ann. Rev. Immunol., 3:477 [1985]). The loss of CD4 helper/induoer T 
5 cell function probably underlies the profound defects in cellular and humoral 
immunity leading to the opportunistic infections and malignancies characteristic of 
AIDS (H. Lane supra). 

Although the mechanism of HlV-associated ITP Is unknown, it is believed to be 
different from the mechanism of ITP not associated with HIV infection. (Walsh et al.. 
10 N. Eng. J. Med., 311:635-639 [1984]; and Ratner, Am. J. Med., 86:194-198 
[1989]). 



1 5 



IV. Current Therapy for Thrombocytopenia 

The therapeutic approach to the treatment of patients with thrombocytopenia is 
dictated by the severity and urgency of the clinical situation. The treatment is similar 
for HlV-associated and non-HIV-related thrombocytopenia, and although a number of 
different therapeutic approaches have been used, the therapy remains controversial. 

Platelet counts in patients diagnosed with thrombocytopenia have been 
successfully increased by glucocorticoid (e.g., prednisolone) therapy, however in most 
20 patients, the response is incomplete, or relapse occurs when the glucocorticoid doss is 
reduced or its administration Is discontinued. Based upon studies with patients having 
HlV-associated ITP, some investigators have suggested that glucocorticoid therapy may 
result in predisposition to AIDS. Glucocorticoids are usually administered if platelet 
count falls below 20 X 10 9 /liter or when spontaneous bleeding occurs. 

2 5 For patients refractory to glucocorticoids, the compound: 

4-(2-chlorphenyl)-9-methyl-2-[3-(4-morpholinyl)-3-propanon-1-yl]6H- 
thieno[3,2,f][1,2,4]triazolo[4.3,a,][1,4]diazepin (WEB 2086) 
has been successfully used to treat a severe case of non HlV-associated ITP. A patient 
having platelet counts of 37,000-58.000/nl was treated with WEB 2086 and after 

3 0 1-2 weeks treatment platelet counts increased to 1 40.000-1 90,000/(il. (EP 

361,077 and Lohman et at.. Lancet , 1 147 [1988]). 

Although the optimal treatment for acquired amegakaryocytic thrombocytopenia 
purpura (AATP) Is uncertain, antithymocyte globulin (ATG), a horse antiserum to 
human thymus tissue, has been shown to produce prolonged complete remission 
3 5 (Trimble et al., Am. J. Hematol., 37:126-127 [1991]). A recent report however, 
indicates that the hematopoietic effects of ATG are attributable to thimerosal. where 
presumably the protein acts as a mercury carrier (Panella ef al.. Cancer Research, 
50:4429-4435 [1990]). 
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Good results have been reported with splenectomy. Splenectomy removes the 
major site of platelet destruction and a major source of autoantibody production in 
many patients. This procedure results in prolonged treatment-free remissions in a 
large number of patients. However, since surgical procedures are generally to be 
5 avoided in immune compromised patients, splenectomy Is recommended only in severe 
cases of thrombocytopenia (e.g. severe HIV-associated ITP), in patients who fail to 
respond to 2 to 3 weeks of glucocorticoid treatment, or do not achieve sustained 
response after discontinuation of glucocorticoid administration. Based upon current 
scientific knowledge, it is unclear whether splenectomy predisposes patients to AIDS. 
1 0 I" addition to prednisolone therapy and splenectomy, certain cytotoxic agents, 

e.g., vincristine, and azidothimidine (AZT, zidovudine) also show promise In treating 
HIV-induced ITP; however, the results are preliminary. 

It will be appreciated from the foregoing that one way to treat 
thrombocytopenia would be to obtain an agent capable of accelerating the 

1 5 differentiation and maturation of megakaryocytes or precursors thereof into the 

platelet-producing form. Considerable efforts have been expended on identifying such 
an agent, commonly referred to as "thrombopoietin" (TPO). Other names for TPO 
commonly found In the literature include; thrombocytopoiesis stimulating factor 
(TSF), megakaryocyte colony-stimulating factor (MK-CSF), megakaryocyte- 

2 0 stimulating factor and megakaryocyte potentiator. TPO activity was observed as early 

as 1959 (Rak et ai, Med. Exp., 1:125) and attempts to characterize and purify this 
agent have continued to the present day. While reports of partial purification of TPO- 
active polypeptides exist (see, for example, Tayrien et a/., J. Biol. Chem., 262:3262 
[1987] and Hoffman era/., J. Clin. Invest 75:1174 [1985]), others have postulated 

2 5 that TPO is not a discrete entity in its own right but rather is simply the 

polyfunctional manifestation of a known hormone (!L- 3, Sparrow et al, Prog. Clin. 
Biol. Res., 215:123 [1986]). Regardless of its form or origin, a molecule possessing 
thrombopoietic activity would be of significant therapeutic value. Although no protein 
has been unambiguously identified as TPO, considerable interest surrounds the recent 

3 0 discovery that mpl, a putative cytokine receptor, may transduce a thrombopoietic 

signal. 

V. Mpl is a Megakaryocytopoletic Cytokine Receptor 

It is believed that the proliferation and maturation of hematopoietic cells is 
3 5 tightly regulated by factors that positively or negatively modulate pluripotential stem 
cell proliferation and multilineage differentiation. These effects are mediated through 
the high-affinity binding of extracellular protein factors to specific cell surface 
receptors. These cell surface receptors share considerable homology and are generally 
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classified as members of the cvtokino , 0 .«„i. 

u ., . cyioxine recepfor superfamily. Members of the 

superfamily include receptors for- IL-? tn an w • . 

3dd -»i n Q o 0 , , 2 (0 and y chams, (Hatakeyama et a,.. Science, 

244.551-556 [1989]; Takeshita et a,., Science. 257:379-382 [1991]) ,L-3 
(Itoh et al., Science, 247:324-328 n ggni- r nm . . 0 

3 7' 5 ; 59 p 54 [1990): ^ ^.Ce,,, 66,165-1174 [1991a,; Kitamura 
eta . Proc. Nat,. Acad. Sci. USA. 88:5082-5086 [1991b,). ,L-4 (Mosley Ja 
Ce,, 59:335-348 [1989,. ,L-5 (Takak, et a,.. EMBO ,.9:4367-4374 [1990 r 
IZZ'Jl T Ce/, ' 66:1175 " 1184 IL-6 (Vamasak, et a,. Science. 

» , ^ IT" H ' b ' " * Ca// '" :1149 - 11 ^ I"**. -L-7 (Goodwin 

ef a/., Ce//, 60:941-951 (1990,). .L-9 (Renault ef a/., P roc . /*,„. „ ca£ , Sc/ USA 

89:5690-5694 [1992,). granulocyte-macrophage colony-stimulating factor (GM- 
CSF) (Gearing eta,., EMBO J., 8:3667-3676 [1991]; Hayashida eta,.. Proc Nat, 
TrJ CL 244:9655 " 9659 ' 1990 »- 9-nu.ocyte co.ony-stimu.at.ng factor 
(G-CSF) (Fukunaga eta,.. Ce//. 61:341-350 [1990a,; Fukunaga eta,.. Proc. Nat, 
Acad. SC. (^,87:8702-8706 [1990b]; Larsen eta,.. J. Exp. Met, 1721559- 
1570 [1990,). EPO (D'Andrea eta,., Ce,, 57:277-285 [1989]; Jones 
76:31-35 [1990]). Leukemia inhibitory factor (LIF) (Gearing eta,.. EMBO J 
0:2839-2848 [1991,,. oncostat.n M (OSM, (Rose et a,., Proc. Nat,. Acad. Sci 
^4,88:8641-8645 [1991]) and also receptors for prolactin (Boutin eta,., Proc 
Nat,. Acad. SC. USA, 88:7744-7748 [1988]; Edery ef a/.. Proc. Nat,. Acad. SC. USA 
86:2112-2116 [1989]). growth hormone (GH, (Leung eta,., Nature, 330 537- 

Isa-sres" [l991] Ci,,ary neUr0,r ° PhiC ,aCt ° r (CNTF) (Davis et *'■> Science, 

Members of the cytokine receptor superfamily may be grouped into three 
funct,onal categories (for review see Nicola et a,., Ce„, 67:1-4 [1991]) The first 
class comprises single chain receptors, such as erythropoietin receptor (EPO-R) or 
granulocyte co.ony st.mu.at.ng factor receptor (G-CSF-R). which bind Hgand with 
high affinity via the extrace.,u.ar domain and also generate an intrace.iu.ar signa. A 
second class of receptor,, so ca..ed «-subunits. inCudes in.er.euk.n-6 receptor 
IL6-R), granulocyte-macrophage colony stimulating factor receptor (GM-CSF-R) 
.nter|euk.„-3 receptor (.L3-R«) and other members of the cytokine receptor 
supply. These a-subunits bind Hgand with ,ow affinity but cannot transduce an 
■ntracenuter signa.. A high affinity receptor ca P ab.e of signaiing is generated by a 
heterodhner between an a-subunit and a member of a third Cass of cytokine 
receptors, termed P -subun,ts. e. g .,fi c , the common p-, ub un.t for the tie !! 
subunits IL3-Ra and GM-CSF-R. 

Evidence that mp, is a member of the cytokine receptor superfamMy comes 
from sequence homoiogy (Gearing. EMBO ,,8:3667-3676 [1988]; Bawn, Proc. 
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Nat,. Aca*. Sd. USA, 87:6834-6938 [1990]; Davis eta,., Science, 253:59-63 
[1991] and Vigon et a,., Proc. Nat,. AcaO. Sci. USA, 89:5640-5644 [1992]) and its 
ability to transduce proliferative signals. 

Deduced protein sequence from molecular cloning of murine c-mp/ reveals this 
protein is homologous to other cytokine receptors. The extracellular domain contains 
465 amino acid residues and is composed of two subdomains each with four highly 
conserved cysteines and a particular motif in the N-terminal subdomain and in the C- 
terminal subdomain. The ligand-binding extracellular domains are predicted to have 
similar double (J-barrel fold structural geometries. This duplicated extracellular 
domain is highly homologous to the signal transducing chain common to IL-3, IL-5 and 
GM-CSF receptors as well as the low-affinity binding domain of LIF (Vigon et at 
Oncogene, 8:2607-2615 [1993]). Thus mpl may belong to the low affinity ligand 
binding class of cytokine receptors. 

A comparison of murine mpl and mature human mpl P. reveals these two 
proteins show 81% sequence identity. More specifically, the N-terminus and C- 
terminus extracellular subdomains share 75% and 80% sequence identity 
respectively. The most conserved mpl region is the cytoplasmic domain showing 91% 
amino acid identity, with a sequence of 37 residues near the transmembrane domain 
being identical in both species. Accordingly, mpl is reported to be one of the most 
conserved members of the cytokine receptor superfamify (Vigon supra). 

Evidence that mpl is a functional receptor capable of transducing a 
proliferative signal comes from construction of chimeric receptors containing an 
extracellular domain from a cytokine receptor having high affinity for a known 
cytokine with the mpl cytoplasmic domain. Since no known ligand for mpl has been 
reported, it was necessary to construct the chimeric high affinity ligand binding 
extracellular domain from a class one cytokine receptor such as IL-4R or G-CSFR. 
Vigon et al., supra fused the extracellular domain of G-CSFR with both the 
transmembrane and cytoplasmic domain of c-mpl. An IL-3 dependent cell line 
BAF/B03 (Ba/F3) was transfected with the G-CSFR/mp/ chimera along with a full 
length G-CSFR control. Ceils transfected with the chimera grew equally well In the 
presence of cytokine IL-3 or G-CSF. Similarly, cells transfected with G-CSFR also 
grew well in either IL-3 or G-CSF. All cells died in the absence of growth factors. A 
similar experiment was conducted by Skoda et al., EMBO J., 12(7):2645-2653 
[1993] in which both the extracellular and transmembrane domains of human IL-4 
receptor (hlL-4-R) were fused to the murine mpl cytoplasmic domain, and 
transfected into a murine IL-3 dependent Ba/F3 cell line. Ba/F3 cells transfected 
with wild type hlL-4-R proliferated normally In the presence of either of the species 
specific IL-4 or IL-3. Ba/F3 cells transfected with hlL-4R/mp/ proliferated 
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normally in the presence of hlL-4 (in the n rfi c«n^ 

that in D fl /n „ «. Presence or absence of IL-3) demonstrating 

that in Ba/F3 cells the mpl cytoolasmir Hnmoin „ * . 

transduce a pro.iferative signa, " a " ^ 6,emen,S * 

of .h CWmen ' C CXPerimentS dem ° nS,ra,e ,he P ro,i,erat,on -fl~"nO capability 

la T d ° main " Si,6nt "»"*" - - r ace„ U ,a! 

2 T , 3 ' i9and - ^ C ° nSiS,ent Wi,h * — *o possibilities, 

namely. mp, ls a single chajn (c)ass one) ^ ^ ^ ^ ^ 

s.gna, transducing p-subunlt (class three) requj , ng an ^ 
et al. supra). 

VI. Mpl Ligand is a Thrombopoletin (TPO) 

As described above. It has been suggested that serum contains a unique factor 
sometimes referred to as thrombopoietin (TPO). that acts synergistic^ with various 
other cytokines to promote growth and maturation of megakaryocytes. No such natural 
factor has ever been isolated from serum or any other source even though considerable 
effort has been expended by numerous groups. Even though i, is no, known whether 
m P , ,s ca P ab.e of directly binding a megakaryocyte stimulating factor, recent 
expenments demonstrate that mp, ,s involved in pro.iferative signal transduction 

2 0 Ihia T^TT ^ *° Semm °' PatiemS aplaSt ' C b0ne — 

20 (Methia et a/.. Blood, 8.2 (5): 1395- 1401 [1993]). 

Evidence that a unique semm colony-forming factor distinct from IL-fa. IL- 

1 u h 1 IL '! 1 ' SCF ' EPO> G '° SF ' ^ GM - CSF ,ra " SduC * S a P"^-*. 
s.gnal through mpl comes from examination of the distribution of cmp, expression «n 

2 b r h ' ,,Ve an i d 1 COmmitted he ™W - -ines and from mp, antisense studies In one 

2 5 of these cell lines. 

Using reverse transcriptase (RT)-PCR in immune-purified human 
hematoocetic celts. Methia *, a,, supra demonstrated that strong mp i mRNA messages 
were only found in CD34+ purified cells, megakaryocytes and platelets. CD34+ cells 
purrf-ed from bone marrow (BM) represents about 1% of a«. BM ce..s and are 

3 0 enriched In primitive and committed progenitors of all lineages (e.g.. erythroid 

granulomacrophage, and megakaryocytic). 

Mpl antisense oligodeoxynucleotides were shown to suppress megakaryocytic 
colony , ormalion from tne p|urjpotent CD34+ ^ ^ ^ ^ 

r^rT"" ^ ^ S ° UrCe °' maaate ^ ccony-stimulating actlvKy 
[MK-CSA]). These same antisense Cigodeoxynucleotides had no effect on erythroid or 
granulomacrophage colony formation. 

Whether mpl directly bound a ligand and whether the serum factor shown to 
cause meoakaryocytopo.esis acted through mp, was stll. unknown. „ had been 
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suggested, however, that if mp/ did directly bind a ligand, Its amino acid sequence was 
likely to be highly conserved and have species cross-reactivity owing to the 
considerable sequence identity between human and murine mpi extracellular domains 
(Vigon et at,, supra [1993]). 

5 

VII. Objects 

In view of the foregoing, it will be appreciated there is a current and 
continuing need in the art to isolate and identify molecules capable of stimulating 
proliferation, differentiation and maturation of hematopoietic cells, especially 
10 megakaryocytes or their predecessors for therapeutic use in the treatment of 
thrombocytopenia. It is believed such a molecule is a mpt ligand and thus there exists 
a further need to isolate such ligand(s) to evaluate their role(s) in cell growth and 
differentiation. 

Accordingly, it is an object of this invention to obtain a pharmaceutically pure 

1 5 molecule capable of stimulating proliferation, differentiation and/or maturation of 

megakaryocytes into the mature platelet-producing form. 

it is another object to provide the molecule in a form for therapeutic use in the 
treatment of a hematopoietic disorder, especially thrombocytopenia. 

It Is a further object of the present invention to isolate, purify and specifically 

2 0 identify protein ligands capable of binding in vivo a cytokine superfamily receptor 

known as mpi and to transduce a proliferative signal. 

It is still another object to provide nucleic acid molecules encoding such 
protein ligands and to use these nucleic acid molecules to produce mpi binding ligands 
in recombinant cell culture for diagnostic and therapeutic use. 

2 5 It is yet another object to provide derivatives and modified forms of the protein 

ligands including amino acid sequence variants, variant glycoprotein forms and 
covalent derivatives thereof. 

It is an additional object to provide fusion polypeptide forms combining a mpi 
ligand and a heterologous protein and covalent derivatives thereof. 

3 0 It is still an additional object to provide variant polypeptide forms combining a 

mpi ligand with amino acid additions and substitutions from the EPO sequence to 
produce a protein capable of regulating proliferation and growth of both platelets and 
red blood cell progenitors. 

It is yet an additional object to prepare immunogens for raising antibodies 
3 5 against mpi ligands or fusion forms thereof, as well as to obtain antibodies capable of 
binding such ligands. 

These and other objects of the invention will be apparent to the ordinary 
artisan upon consideration of the specification as a whole. 
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SUMMARY OF THE INVENTION 

The objects of the invention are achieved by providing an isolated mammalian 
megakaryocytopoietic proliferation and maturation promoting protein, denominated 
the 'mpl ligand" (ML) or "thrombopoietin" (TPO), capable of stimulating 
proliferation, maturation and/or differentiation of megakaryocytes into the mature 
platelet-producing form. 

This substantially homogeneous protein may be purified from a natural source 
by a method comprising; (1) contacting a source plasma containing the mpl ligand 
molecules to be purified with an immobilized receptor polypeptide, specifically mpl or 
a mpl fusion polypeptide immobilized on a support, under conditions whereby the mpl 
ligand molecules to be purified are selectively adsorbed onto the immobilized receptor 
polypeptide. (2) washing the immobilized receptor polypeptide and its support to 
remove non-adsorbed material, and (3) eluting the mpl ligand molecules from the 

1 5 immobilized receptor polypeptide to which they are adsorbed with an elution buffer. 

Preferably the natural source is mammalian plasma or urine containing the mpl 
ligand. Optionally the mammal is aplastic and the immobilized receptor is a mp/-lgG 
fusion. 

Optionally, the prefered megakaryocytopoietic proliferation and maturation 

2 0 promoting protein is an isolated substantially homogeneous mpl ligand polypeptide 

made by synthetic or recombinant means. 

The "mpl ligand" polypeptide or "TPO" of this invention preferably has at least 
70% overall sequence identity with the amino acid sequence of the highly purified 
substantially homogeneous porcine mpl ligand polypeptide and at least 80% sequence 
identity with the "EPO-domain" of the porcine mpl ligand polypeptide. Optionally, the 
mpl ligand of this invention is mature human mpl ligand (hML). having the mature 
amino acid sequence provided in Fig. 1 (SEQ ID NO: 1), or a variant or 
posttranscrlptionally modified form thereof or a protein having about 80% sequence 
identity with mature human mpl ligand. Optionally the mpl ligand variant is a 
30 fragment, especially an amino-terminus or 'EPO-domain" fragment, of the mature 
human mpl ligand (hML). Preferably the amino terminus fragment retains 
substantially all of the human ML sequence between the first and forth cysteine 
residues but may contain substantial additions, deletions or substitutions outside that 
region. According to this embodiment, the fragment polypeptide may be represented by 
35 the formula: 

X-hML(7-151)-Y 

Where hML(7-151) represents the human TPO (hML) amino acid sequence from Cys? 
through CyslSi inclusive; X represents the amino group of Cys? or one or more of the 
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amino-terminus amino acid residue(s) of the mature hML or amino acid residue 
extensions thereto such as Met, Tyr or leader sequences containing, for example, 
proteolytic cleavage sites (e.g. Factor Xa or thrombin); and Y represents the carboxy 
terminal group of Cys^l 0 r one or more carboxy-terminus amino acid residue(s) of 
5 the mature hML or extensions thereto. 

Optionally the mpl ligand polypeptide or fragment thereof may be fused to a 
heterologous polypeptide (chimera). A preferred heterologous polypeptide is a 
cytokine, colony stimulating factor or interleukin or fragment thereof, especially kit- 
ligand (KL), IL-1, IL-3, IL-6, IL-11, EPO, GM-CSF or LIF. An optional preferred 
10 heterologous polypeptide is an immunoglobin chain, especially human lgG1, lgG2 f 
lgG3, lgG4 ( IgA, IgE, IgD, IgM or fragment thereof, especially comprising the constant 
domain of an IgG heavy chain. 

Another aspect of this invention provides a composition comprising an isolated 
mpl agonist that is biologically active and is preferably capable of stimulating the 

1 5 incorporation of labeled nucleotides (e.g., 3H-thymidine) into the DNA of IL-3 

dependent Ba/F3 cells transfected with human mpl. Optionally the mpl agonist is 
biologically active mpl ligand and is preferably capable of stimulating the 
incorporation of 3 5s into circulating platelets in a mouse platelet rebound assay. 
Suitable mpl agonist include hML 153 , hML(R153A, R154A), hML2, hML3, hML4, 

2 0 mML, mML2, mML3, pML, and pML2 or fragments thereof. 

In another embodiment, this invention provides an isolated antibody capable of 
binding to the mpl ligand. The isolated antibody capable of binding to the mpl ligand 
may optionally be fused to a second polypeptide and the antibody or fusion thereof may 
be used to isolate and purify mpl ligand from a source as described above for 

2 5 immobilized mpl. In a further aspect of this embodiment, the invention provides a 

method for detecting the mpl iigand In vitro or In vivo comprising contacting the 
antibody with a sample, especially a serum sample, suspected of containing the iigand 
and detecting if binding has occurred. 

In still further embodiments, the invention provides an isolated nucleic acid 

3 0 molecule, encoding the mpl ligand or fragments thereof, which nucleic acid molecule 

may optionally be labeled with a detectable moiety, and a nucleic acid molecule having a 
sequence that is complementary to, or hybridizes under moderate to highly stringent 
conditions with, a nucleic acid molecule having a sequence encoding a mpl ligand. 
Preferred nucleic acid molecules are those encoding human, porcine, and murine mpl 
35 ligand, and include RNA and DNA, both genomic and cDNA. In a further aspect of this 
embodiment, the nucleic acid molecule is DNA encoding the mpl ligand and further 
comprises a replicable vector in which the DNA is operably linked to control sequences 
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recognized by a host transformed with the vector. Optionally the DNA is cDNA having 
the sequence provided in Fig. , 5 '.3' (SEQ ID NO: 2), 3'-5- or a fragment thereof. 
This aspect further includes host cells, preferably CHO cells, transformed with the 
vector and a method of using the DNA to effect production of mpl ligand, preferably 
comprising expressing the cDNA encoding the mpl ligand in a culture of the 
transformed host cells and recovering the mpl ligand from the host cells or the host 
cell culture. The mpl ligand prepared in this manner is preferably human mpl ligand. 

The invention further includes a method for treating a mammal having a 
hematopoietic disorder, especially thrombocytopenia, comprising administering a 
therapeutically effective amount of a mpl ligand to the mammal. Optionally the mpl 
ligand is administered in combination with a cytokine, especially a colony stimulating 
factor or interleukin. Preferred colony stimulating factors or interleukins include; 
kit-ligand <KL), LIF, G-CSF, GM-CSF. M-CSF, EPO. IL-1, IL-3. | L -6. and IL-11. 

The invention further includes a process for isolating and purifying TPO (ML) 

1 5 from a TPO producing microorganism comprising: 

( 1 ) disrupting or lysing cells containing TPO, 

(2) optionally seperating soluble material from insoluble material 
containing TPO, 

( 3 ) solublizing TPO in the insoluble material with a solublizing buffer, 

2 0 ( 4 ) seperating solublized TPO from other soluble and insoluble material, 

( 5 ) refolding TPO In a redox buffer, and 
( 6 ) separating properly folded TPO from misfolded TPO. 
The process provides for solubillzing the insoluble material containing TPO 
with a chaotropic agent where the chaotropic agent is selected from a salt of guanidine, 
sodium thiocyanate, or urea. The process further provides that solublized TPO Is 
seperated from other soluble and insoluble material by one or more steps selected 
from centrafugation, gel filtration and reverse phase chromotography. The refolding 
step of the process provides for a redox buffer containing both an oxidizing and 
reducing agent. Generally, the oxidizing agent is oxygen or a compound containing at 
least one disulfide bond and the reducing agent is a compound containing at least one 
free sulfhydryl. Preferably, the oxidizing agent la selected from oxidized 
glutathlone(GSSG) and cystine and the reducing agent is selected from reduced 
glutathlone(GSH) and cysteine. Most preferably the oxidizing agent is oxidized 
glutathione(GSSG) and the reducing agent is reduced glutathione(GSH). It Is also 
prefered that the molar ratio of the oxidizing agent is equal to or greater then that of 
the reducing agent. The redox buffer additionally contains a detergent, preferably 
selected from CHAPS and CHAPSO. present at a level of at teast1%. The redox buffer 
additionally contains NaCI preferably at a concentration range of about 0.1-0.5M, and 
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glycerol preferably a. a concentration greater than 15%. The P H of the redox buffer 
preferably ranges from about P H 7.5- P H 9.0. and the refolding step is conducted at 4 
degrees for 12-48hr. The refolding step produces biologically active TPO in which a 

disulfide bond is formed between the Cv« n «r fl ct *k„ . 

1 ine uvs nearest the amino-terminus with the Cys 
nearest the carboxy-terminus of the EPO domain. 

The invention further includes a process for purifying biologically active TPO 
from a microorganism comprising: 

( 1 ) lysing at least the extracellular membrane of the 
microorganism, 

( 2 ) treating the lysate containing TPO with a chaotropic agent, 
( 3 ) refolding the TPO, and 

( 4 ) separating impurities and misfolded TPO from properly folded 

TPO. 
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BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 shows the deduced amino acid sequence (SEQ ID NO: 1) of human mpl ligand 
(hML) cDNA and the coding nucleotide sequence (SEQ ID NO: 2). Nucleotides are 
numbered at the beginning of each line. The 5* and 3' untranslated regions are 
indicated in lower case letters. Amino acid residues are numbered above the sequence 
starting at Ser 1 of the mature mpl ligand (ML) protein sequence. The boundries of 
presumed exon 3 are indicated by the arrows and the potential N-glycosylation sites 
are boxed. Cysteine residues are indicated by a dot above the sequence. The underlined 
sequence corresponds to the N-terminal sequence determined from mpi ligand purified 
from porcine plasma. 

Fig. 2 shows the procedure used for the mpl ligand 3 H - t hymidine incorporation 
assay. To determine the presence of mpl ligand from various sources, the mpl P Ba/F3 
cells were starved of IL-3 for 24 hours in a humidified incubator at arc In 5% C0 2 
and air. Following IL-3 starvation the cells were plated out in 96 well culture dishes 
w.th or without diluted samples and cultured for 24 hrs in a cell culture incubator 
20 uJ of serum free RPMI media containing 1 uC i of 3 H -thymidine was added to each 
well for the last 6* hours. The cells were then harvested on 96 well filter plates and 
washed with water. The filters were then counted. 

Fig. 3 shows the effect of pronase, DTT and heat on the ability of APP to stimulate 
Ba/F3-mp/ cell proliferation. For pronase digestion of APP, pronase (Boehringer 
Mannheim) or bovine serum albumin was coupled to Affl-gellO (Biorad) and 
mcubated individually with APP for 18hrs. at 37X. Subsequently, the resins were 
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removed by centrifugation and supematants assavsd add 

4 • -e wm DTT tojlowed by dia ~ P ; B r als ° to «« ** 

s s? P ;;r:;:r;.:r ,i9and ^ 
we, „ Peak uom ,he mpi ^ — 

ii-on m r r s ' PAGE °' eiuted u,,raiink ^' ,racti ° ns - t ° 200 - - — 

0 IZ ^L SCe,0ne C ° n,ainin9 HCI * " 2 ° BC W3S A «- 3hrs. a. 

20 C sampies were centrifuged and resu.tant pei.ets were washed 2x with acetone at 

blr l l IT Pe " e,S SUbSeqUent,/ d,SS0,Ved 30 »" ° f SDS-so«ubi ta a«ion 
buffer, made 100 uM DTT and heated at 90-C for 5 mln. The samp.es were then 
resold on a 4- 20% SDS-po.vacry.amide 9 e, and prote. j££ £ 

F.g 6 shows e.ution o, mpl , iga nd activity from SDS-PAGE. Fraction 6 from the 
n^m* co, umn was reso|ved Qn a 4 . 2Q% 

e d uc, ng con ,„ons. Plowing e.ectrophores. the ge, was siiced .to 1 equa, regZ 
an e. ctroeiuted as described in the examp.es. The eiectroe.uted sTmp.es we" 
d ,, y2 ed ,nto PBS and assayed a, a 1,20 di.ut.on. The Mr standards used to c ,Z 
the gel were Novex Mark 12 standards. calibrate 

Fig. 7 shows the effect of mpl liqand deleted apd u 

a ae P'e«e<j APP on human megakaryocvtODoiesis 

to « »«. At a., o. 2 M 4. ^pHga (0.5 „ anp-^jo ,0.5 m) m adaM 
ng. . «M. b* mw. 01 . 3» bp .„ grn „ o, humm D|w 

^ Th. MUM «*, Md „ y 



-19. 



WO 95/18858 



PC17US94/14553 



10 



Fig. 10 shows deduced amino acid sequence of m „w« ». 

ID NO- 6) ann ™ . u SB <wenco of mature human mpl ligand (hML) (SEQ 

ID NO. 6) and mature human erythropoietin (hEPO) (SEQ ID NO 7^ tk J 

H o t " 3 br ° k8n ,0f hEP °- ^ «"» important for 

erythropoietin activity are Indicated by a large dot. 

(seq 1 acid sequence ° f ^ ^ ^ **™ ^ 

of 1 } ' ( E ° ' D 8,1 hML3 (SE ° ,D N0: " ™" (SEQ ID NO- 

iiLet:::;:: ° acids are boxed and 9aps ~ - — « 



Figs 1 2A) 12B and 12C show the e((ec , Qf humgn 
15 pron era on (A), in vitro human megaKaryocytopoiesis Stated using 

d.olabeied murine , 9 G monoclona. antibody specific to the megakaryocyte 

" ,a (B) ' " murine thrombopoies,s measured in a 

Two hundred ninety-three ce.is were transfected by the CaP0 4 method 
20 (Gorman, C in DMA Cloning : A New Approach 2-143-igo rifl«™ w»„ o„ c 

^ o, * PRK , MLl53 ov. m , snt zzzzrzsz 

Introduce a slop codon alt., residua ,S3 ol hML by PCR). Madia was Ih.n 

25 Maoatoyocytopoiasls was ouanlltatad „,,„, , 125, radtoi.b.,,,,, „ , ' 

a, ds «.d (G ,.„, „ 8 , 00<1 „ : ,3 34 . 1 3 39 J » 

P-»d reeonfcina* „ L (rML) „ „ ^ pW#|M ^ 

30 M „4:, M6 ., M , 2 ,„„,. Partially p«ad ,„,. „.„„,.„ £ 

nf^ u(w ' wi * PBS containing 2M sacn of uraa and NaQ. Th. a*, to*,, 
was dlalyzed irrto PBS and mada 1mg/ml wilh tndoloxin fres BSA Th. ^1 

^ rjr - 01 enaMo * m * - *- - ^ 

TJTt? P t,. 

2 tailed T-test comparing medians. 
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2 17 hUman mP ' " 9and IS0 ' 0miS and f- Ba/F3- 

A ' aSSay - m0Ck ' hML2 ' hML3 ' hML <™3A. R154A), and 

hML 153 W6 re assayed at various dilutions as described In Exampie 1. 

Figs. 14A, 14B and 14C show the deduced amino acid sequence (SEQ ID NO- 1) of 
human mp, ligand (hML) or human TPO (hTPO) and the human genomic DNA coding 
sequence (SEQ ,D NO: 11). Nucleotides and amino acid rescues are numbered at the 
beginning of each line. 

Fig. 15 shows a SDS-PAGE of purified 293.rhML 332 and purified 293-rhML 153 . 

Fig. 16 shows the nucleotide sequence: cDNA coding (SEQ ID NO: 12) and deduced 
amino acid sequence (SEQ ID NO: 13) of the open reading frame of a murine ML 
soform. This mature murine mp, ligand isoform contains 331 amino acid residues 
four fewer than the putative full length mML, and is therefore designated mML2. 
Nuc.eot.des are numbered at the beginning of each iine. Amino acid residues are 
numbered above the sequence starting with Ser 1. The potentiai N-glycosylation sites 
are underlined. Cysteine residues are indicated by a dot above the sequence. 

Fig. 17 shows the cDNA sequence (SEQ ID NO: 14) and predicted protein sequence 
(SEQ ID NO: 15) of this murine ML isoform (mML). Nucleotides are numbered at the 
beg.nn.ng of each line. Amino acid residues are numbered above the sequence starting 
wrth Ser 1. This mature murine mp, iigand isoform contains 335 amino acid residues 
and is believed to be the ful. .ength mp, ..gand. designated mML. The signal sequence * 
indicated with a dashed underline and the likeiy cleavage point Is denoted with an 
arrow. The 5' and 3' untransiated regions are indicated with lower case letters The 
two delations found as a resuit of alternative spiicing (mML2 and mML3) are 
underfed. The four cysteine residues are indicated by a dot. The seven potential N- 
3 0 glycosylate sites are boxed. 
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2nl ZT 9 ' ** d6dUCed ami "° ^ Se<,UenCe °' ** huma " ML ■"*"» hML3 
(SEQ ID NO: 9) and a murine ML Isoform designated mML3 (SEQ ID NO- 16) The 

predicted amino add sequence for the human mp, iigand Is aiigned with the murine mp, 

«.gand sequence. Identicai amino acids are boxed and gaps introduced for optima. 

ailment are Seated by dashes. Amino acids are numbered at the begging of each 

line. 
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Fig. 19 compares the predicted amino acid sequences of mature ML isoforms from 
mouse-ML (SEQ ID NO: 17). porcine-ML (SEQ .0 NO: ,8) and human-ML (SEQ .0 NO: 
6). Am.no acid sequences are aligned with gaps, indicated by dashes, introduced for 
ophmal almgment. Amino acids are numbered at the beginning of each line with 
Identical residues boxed. Potentiai N-glycosylation sites are indicated by a shaded box 
and cysteine residues are designated with a dot. The conserved di-baslc amino acid 
motif that presents a potential protease cleavage site is underlined. The four amino 
acid deletion found to occur in all three species (ML2) is outlined with a bold box. 

Fig. 20 shows the cDNA sequence (SEQ ID NO: 19) and predicted mature protein 
sequence (SEQ ID NO: 18) of a porcine ML isoform (pML). This porcine mpl ligand 
isoform contains 332 amino acid residues and is believed to be the full length porcine 
mpl ligand, designated pML. Nucleotides are numbered at the beginning of each line. 
Amino acid residues are numbered above the sequence starting with Ser 1. 

Fig. 21 shows the cDNA sequence (SEQ ID NO: 20) and predicted mature protein 
sequence (SEQ ID NO: 21) of a porcine ML isoform (pML2). This porcine mpl ligand 
isoform contains 328 amino acid residues and is a four residues deletion form of the 
full length porcine mpl ligand. designated pML2. Nucleotides are numbered at the 
beginning of each line. Amino acid residues are numbered above Ihe sequence starting 
with Ser 1. 



Fig. 22 compares the deduced amino acid sequence of the full length porcine ML 
isoform pML (SEQ ID NO: 18) and a porcine ML isoform designated pML2 (SEQ ID NO: 
2 5 21 ). The predicted amino acid sequence for the pML is aligned with pML2 sequence. 
Identical amino acids are boxed and gaps introduced for optimal alignment are indicated 
by dashes. Amino acids are numbered at the beginning of each line. 
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Fig. 23 shows the pertinent features of plasmid pSVIS.ID.LLMLORF ('full length" or 
TP0 332 ) used to transfect host CHO-DP12 cells for production of CHO-rhTP0 33 2. 

Fig. 24 shows the pertinent features of plasmid pSVI5.ID.LLMLEPO-D ("truncated" 
or TP0 153 ) used to transfect host CHO-DP12 cells for production of CHO-rhTP0 153 . 

Flge. 25A, 25B, and 25C show the effect of E. co//-rhTPO(Mef 1 , 153) on 
platelets (A), red blood cells (B) and (C) white blood cells in normal mice. Two 
groups of 6 female C57 B6 mice were injected daily with either PBS buffer or 0.3ug 
E. coff-rhTPOfMef 1 . 153) OOOui sc.). On day 0 and on days 3-7 40nJ of blood was 
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taken from the orbital sinus. This blood was i mmediate , y dlluted in 10 ml of 
eon.nMre.ai diluant and complete blood counts were obtained on a Serrono Baker 
Hematology Analyzer 9018. The data arp nr,,^., serrono oaxer 

aata are presented as means ± Standard error of the 

mean. 

(AWe^' 7 T Sh ° W ^ *** °' £ "*™W, 15 3) on p.ate.ets 
rouo u w (C) Wh ' ,e b ' 00d Ce " S SUb,etha,, y radiated mice. Two 

groups o 10 female C57 B6 mice were sub.ethally irradiated with 750 cGy of gamma 

. ' 08 S ° UrCe ^ ini6C,ed dai * ^ — ' PBS »** « 3 o,g s 

«*rhTPO( M .t-1, 153) (100,1 sc.). On day 0 and at subsequent intermediate time 

b ' 00d ^ t3k9n fr ° m ^ ° rbftal S ' nUS - This b,00d «- '—diateiy 
" C ° mmerCial di ' Uant bl °<* counts were obtained on a 

Serrono Baker Hema.o.ogy Analyzer 9018. The data are presented as means + 
Standard error of the mean. ~ 

Figs. 27A. 27B and 27C show the effect of CHO-rhTP0332 on (A) p.ate.e.s 
thrombocytes), (B) red blood cei.s (erythrocytes) and (C, white biood celis 
(leukocytes) ,n norma, mice. Two groups of 6 female C57 B6 mice were infects 
daily with either PBS buffer or 0„g CHO-rhTPO^ (100a, so.Z Z 0 Z Z 

dtL'Tn 7 1 7" ^ t3ken fr0m 0rbl,a ' 8inUS - ™ S bto0d WBS 

Serrono B ^t w C ° mmBrCial d " Uam C ° mP ' e,e ^ ™« ™° on a 

Serrono Baker Hemato.ogy Analyzer 9018. The data are presented as means ± 

Standard error of the mean. 

Fig. 28 shows dose response curves for various forms of rhTPO obtained from 
vanous ce.i lines. Dose response curves were constructed to rhTPO from the 

iz^n n v jp ° 332 from cho (,un ^ ,r ° m «*- ha ~; e 

eel s ; hTPOMefl ,53 (£. co/Aderived truncated form with an N-term.nai 
methionine); hTP0 332 (Mi .ength TPO from human 293 ce. te) ; MeMess 155 E-CoH 
(the truncated form [rhTP0i 55 ] without the terminal methionine from E com 
Groups of 6 female C57B6 mice were .njected daily for 7 days with rhTPO depending 

IZTZ' w T ^ * ^ WM ,3ken ^ « h9 si ™ *» ~-P'ete blood 
count. The data presented above are the maxima, effects seen with the various 
treaunents and with the exception of (met 153 F-Co//) this occurred on day 7 o, 
*e*™«. in the aforementioned "me. 153 E-Co* group the maxfcnal effect was seen 
on day 5. The data are presented as means ± Standard error of the mean 
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rorms of rnTPO produced in CHO cells w/rth th« ♦ 

6 .ema.e C57B6 mice were inie cted ail 1 3,6(1 £ "* °' 

2-7 40„ C b ,ood was ta n to m 7 h ^ ^ °' ^ *~ °" **» 
5 Treatment groups were TPO S3 Z m ^ ** COmP,8,e b '°° d C ° U " ,S - 

Taction) Fui, length TPO To • ^ " ^ * "* ^ (M ' X 

dipped terms: SSa^^)^ " "* 

•mix- nr«n« ,• -J } = pur " ied C " Pped ,raction f ™ ^ original 

: «™;: zzzrr * pu * m ,u sm ™ *-» - 

i o mo ™,„. ** ™ """""" " m "" ! * s "™"« »' 

FIS. 30 k . showta( , „, K , RA EUSA ^ 

£i tt° * ,mera m ,e,ava ™ ~ °' - * ™ 

. 5 11! po " ion 01 ma • *- aia °™ «>« -*» - •» 

15 figure) showing relavant steps of the assay. 

Fig. 31 is a fiow chart for the KIRA EL.SA assay showing each step in the procedure. 

2 0 eTore """^ ^ nUClB ° tide (SE ° ,D N0: *> - *• l*V.17 ID LL 

20 express.on vector used for expression of Rse.gD in Example 17. 

Fig. 33 is a schematic representation of the preperation of piasmid pMP1. 
^ Fig. 34 is a schematic representation of the preperation of piasmid P MP21. 

Fig. 35 is a schematic representation of the preperation of piasmid pMPisi. 

Fig. 36 is a schematic representation of the preperation of piasmid p MP2 02. 
30 Fig. 37 is a schematic representation of the preperation of piasmid P MP172. 
Fig. 38 is a schematic representation of the preperation of piasmid P MP210. 

... i ^:::^7T a r - — — 

Fig. 40 is a schematic representation of the preperation of piasmid pMP41. 
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Fig. 41 is a schematic representation nf th Q 

at,0n 0f the Proration of pfasmid pMP57. 

Fig. 42 is a schematic representation nf th« 

presentation of the preperation of plasmid pMP251. 

5 DETAILED DESCRIPTION OF THE INVENTION 

I. Definitions 

used JlZTl'T ,0lk5Win9 W ° rdS " PhraS6S ^ *° -"Won when 

used in the description, examples, and claims. 

1 0 suitab.e"^et at a9enr r8,erS t0 " C ° mP0Und aqUS ° US «**■" - 

suitable concentrates, can cause a change in the spatial configuration or 

-tormat-on o, a protein by a, ieas, partiaiiy disrupting the forces res o , 

ma.nta.mng the norma, secondary and tertiary structure of the pro! Su h 

compo s inciude, for examp.e, urea, guanidine-HC. and sodium thiocyana.e. High 

con ent rat.ons, usua„y 4-9M, of these compounds are normally reared to exer, the 

1 5 conformational effect on proteins. 

ac, on 'IZTl' Sen °" C P, °' e ™ " lMSM * - <"0» la *" <**» 
•ct on .no*., c.11 ,„,.,„„„,,,, „,.„.,„„ 

taUM. n,„o,„„. „ Mllona , hom , on<s ^JZZZ, Z 

«. ~y, 9row , ^ 8roi>lh °™». 

o»„. ,„ su n . prolnsulln , pror(1Mn siycop;wsin y -» 

o,l,el. „,„„,»,,„„ hormon . (FSH) , hyrold «• 

« g h o, m .„. (LH) . „.„,., wtellc ^ tec ,„« B . p . 

Z!l T mT"' ,a! " :,,, " en *"'"" a ' srowm ,aa ™- "«v« ««th 

toon, su oh „ NGF-J), p, M .,.,^ ,«,„, ,„„, ^ •»* 

(EPOI. oslooinduoBve lactor.. im.rf.mn. «,d> a. w.rt mn „ . Z W ™ 

«. IH. ILL. IL-2. IUS, IL-4. .L-5. 11-e, IL-7. IL-S. IL-», lt .„ ,, L .„,, 
— po^p*. ,«,„. UF , S0F „ a J£ ^ 

in type or extent of glycosylate. 
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mp, ligand". - mp , ,i gan d polypeptide", "ML", "thrombopoielin" or TPO" are 
use interchangeably herein and comprise any polypeptide that possesses the property 
of bind.ng to mpl. a member of the cytokine receptor superfam.iy, and having . 
b,o.o g ,ca. property o, the ML as defined below. An exempiary biological property „ 
the b,«y to simulate the incorporation of iabeled nucleotides (e.g., 3 H -thymid«ne) 
-mo the DNA of IL-3 dependent Ba/F3 ce.is transfected with human mpl P. Another 
exemplary biological properly is the ability to stimulate the incorporation of 35 s in ,o 
circulating p.ate.ets in a mouse p.atelet rebound assay. This definition encompasses 
he polypeptide isolated from a mp , , igand S0U rce such as ap.astic porcine p.asma 
descnbed herein or from another source, such as another anlma. spec.es, inciudlng 
humans or prepared by recombinant or synthetic methods and includes variant forms 
inciudlng functional derivatives, fragments, alleles, isoforms and anatogues thereof 

A 'mpl ligand fragment" or "TPO fragment" is a portion of a naturally 
occurring mature full length mpl ligand 0 r TPO sequence having one or more amino 
acd residues or carbohydrate units deleted. The deleted amino acid residue(s) may 
occur anywhere in the peptide including at erther the N-termina. or C-termina. end or 
•nternaily. The fragment wil. share at least one biofogica. property in common with 
mpl l,gand. Mpl ligand fragments typically will have a consecutive sequence of at least 
10. 15, 20, 25. 30. or 40 amino acid residues that are identicai to the sequences of 
the mpl ligand isolated from a mamma, including the ligand isolated from aplastic 
porcine p.asma or the human or murine .igand, especially the EPO-domain thereof 
Representative examples of N-termlna. fragments are hML 153 or TPOfMefi 1-153) 
•Mpl ligand variants" or "mp/ ligand sequence variants" as defined herein 
means a biologically active mpl ligand a, defined below having less than 100% 
sequence identity with the mp/ ligand isolated from recombinant cell culture or 
aplastic porcine plasma or the human Hgand having the deduced sequence described In 
Fig. 1 (SEQ ID NO: 1). Ordlnariiy, a bioiogica.ry active mpl .igand variant w... have 
an amino acid sequence having at least about 70% amino acid sequence identity with the 
mpl ligand isolated from aplastic porcine plasma or the mature murine or human 
ligand or fragments thereof (see Fl 9 . 1 [SEQ ID NO: 1J). preferably at least about 
75%. more preferably at least about 80%. still more preferably at least about 85% 
even more preferably at least about 90%. and most preferably at least about 95% 

A "chimeric mpl ligand" is a polypeptide comprising full length mpl ligand or 
one or more fragments thereof fused or bonded to a second heterologous polypeptide or 
one or more fragments thereof. The chimera will share at least one biological 
property in common with mp, ligand. The second polypeptide w... typically be a 
cytokine, immunoglobin or fragment thereof. 
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"isolated mpi hgand", - highly purified 'substantially 
homogeneous mp/liaand" are ..can u suosiannaiiy 

purified from a VZtZZTTT*™™*"™"^ 

m «thoH . , h3S been prepared bv recombinant or synthetic 

method and is sufficiently tree of other peptides or proteins (1) ,o obtain a, , 
5 and P referab,y 20 amino acid residues of the N ,erminaJ or of an interna, HTJd 
sequence by using a spinning cup sequenator or the best commercially avai.ab.e amino 
acd sequenator marketed or as modified by pub.ished methods as of the fiiing date of 
th.s application, or (2) , 0 homogeneity by SDS-PAGE under non-reducing or reducing 
conditions using Coomassie blue or, preferably, sliver stain. Homogenerty here means 
1 0 less than about 5% contamination with other source proteins. 

"Biological property when used in conjunction with either the "mp/ ligand" or 
isolated mp, ligand" means having thrombotic activity or having an in vivo 
effector or antigenic function or activity that is directly or ind.rectiy caused or 
performed by a mp, iigand (whether in its native or denatured conformation) or a 
5 fragment thereof. Effector functions includemp, binding and any carrier binding 
act, y. agonism or antagonism of mp,. especiaHy transduction of a probative 
s.gna .nciuding replication. DNA regu.atory function, moduiat.on of the biologica. 
act,v,ty of other cytokines, receptor (especially cytokine, activation, deactivation 
up- or down regulation, cei. growth or differentiation and the like. An antigenic 
funct.cn means possession of an epitope or antigenic site that is capable of cross- 
reacts with antibodies raised against the native mp, .igand. The principal antigenic 
hnrton of a mp, ilgand po.ypeptide is that it binds with an affinity of a. least about 
0 Umole ,o an antibody raised agafcst the mp, Hgand iscteted from apias.ic porcine 
Plasma. Ordmari.y. the polypeptide binds with an affinity of at .east about 10? 
^o.e. Most preferably, the ant.genica.ly active mp/ ligand po.ypept.de is a 
PO^PM. that b.nds to an antibody raised against ,he mp, ,,gand having one of the 
above described effector functus. The antibodies used to define -b.olog.Ly activity 
are rabbrt po.yc.ona. antibodies raised by formu.at.ng the mp/ iigand .so.a.ed from 
recombinant cel. cufture or ap.as«ic porcine piasma in Freund's compiete adjuvant 
subcutaneous* injecting the formu.at.on. and boosting the immune response by 
muapejtonea. inject.on o, the formation until the trter of mp, Hgand ant.body 

'Biological* active" when used in conjunction with either the "mp/ „g an d" or 
.seated mp/ ^and" means a mp/ iigand or po.ypept.de tha, exhibtts thrombopoietic 
activity or shares an effector function of the mp/ Hgand teoiated from aplastic porCne 
P-asma or expressed in recombinant ceil culture deserved he re .n. A pHncipa. known 
e«ec or function of the mp/ Hgand or po.ypept.de herein .s b.nding to mp/ and 
stimu.ating the incorporation of labeled nucleotides ^-thymidine) Into the DNA of 
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IL-3 dependent Ba/F3 cells Iransfected with hi m » , « . 

, .. . 0 Wltn numan mi* P- Another known effector 

function of the mpl ligand or DolvnontHa 

„ a ra or polypeptide herein is the ability to stimulate the 
incorporation of 35s into circulate ni a »«i„. • 

circulating platelets in a mouse platelet rebound assay Yet 
another known effector function nf <■ ^ , 

h„ m3 n . P ' ' l9and is the ab,ll, y t0 stimu 'ate in vitro 

human megakaryocytopoiesis that may De quantltated by using , radio 

monociona. antibody specific to the megakaryocyte glycoprotein GP ' b „, a . 

"Percent amino acid sequence identity" with respect to the mpl ligand sequence 
•s darned herein as the percentage of amino acid residues in the candidate sequence that 
are identical with the residues in the mpi ligand sequence Isolated from aplastic 
porc.ne plasma or the murine or human iigand having the deduced amino acid sequence 
described in Fig. 1 (SE Q ,D NO: 1), after aligning the sequences and Introducing gaps 
if necessary. ,o achieve the maximum percent sequence identity, and not considering 
any conservative substitutions as part of the sequence identity. None of N-termina. 
C-termmal. or internal extensions, deletions, or insertions into the mpl .igand 
sequence shall be construed as affecting sequence identity or homology Thus 
exemplary biologically active mpl ligand polypeptides considered to have identical 
sequences Include; prepro-mp/ ligand. pro-mp/ ligand, and mature mpl ligand 

'Mpl ligand mlcrosequencing" may be accomplished by any appropriate 
standard procedure provided the procedure is sensitive enough. In one such method 
highly purged polypeptide obtained from SOS gels or from a final HPLC step are 
sequenced directly by automated Edman (pheny. isothiocyanate) degradation using a 
model 470A Applied Biosystems gas phase sequencer equipped with a 120A 
Phenylthiohydantion (PTH, amino acid analyzer. Additionally, mp„i ga nd fragments 
prepared by chemical (e.g., CNBr, hydroxyzine. ^nitro-S-thlocyanobenzoate) or 
enzymatic (e.g., trypsin, clostrlpain, staphylococci protease) digestion followed by 
fragment purification (e.g.. HPLC) may be similariy sequenced. PTH amino acids are 
analyzed using the ChromPerfect data system (Justice Innovations. Palo Alto CA) 
Sequence interpretation is performed on a VAX 11/785 Digital Equipment Co" 
computer as described by Henze. et a/.. J. Chromatography, 404:41-52 [1987]' 
Optionally, aiiquots of HPLC fractions may be electrophoresed on 5-20% SDS-PAGE 
^transferred to a PVDF membrane (ProBlott. AIB, Foster City. CA) and stained 
with Coomassie Brilliant Blue (Matsurdiara. J. Biol. CAem.. 262:10035-10038 
[1987]. A specific protein identified by the stain is excised from the blot and N- 
terminal sequencing is carried out with the gas phase sequenator described above For 
internal protein sequences. HPLC fractions are dried under vacuum (SpeedVac) 
resuspended in appropriate buffers, and digested with cyanogen bromide, the Ly* 
specific enzyme Lys-C (Wako Chemicals. Richmond. VA). or Asp-N (Boehrlnger 
Mannheim. IndianapoHs. IN). After digestion, the resultant peptides are sequenced as a 
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0 1 y TFA nrinr . u developed with a propanol gradient in 

0.1% TFApnor to gas phase sequencing 
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Thrombocytopenia- is defined as a piateie, coun, below 150 X 10» per ,fter of 

5 acceie JnoTh b e° POi r ^ " ^ " bi ° l09,Ca ' *« °< 

accelerating the prol.ferat.on, differential and/or maturation of megakaryocytes or 

megakaryocyte precursors into the p.ate.e, producing form of these ceils This 
activity may be measured in various assays including an ,n vivo mouse platelet 
rebound synthesis assay, induction of platelet cel. surface antigen assay as measured 
»V an antiplatelet immunoassay (anti-GPII b ||| a) , or « numan , eukemja 
megaloblastic cel. iine (C MK). and induction of po.ypioidizat.on ,„ a 
megakaryoblastic cell line (DAMI). 

Thrombopoietin- (TPO) is defined as a compound having tfirombopoie.ic 
activity or being capab.e of increasing serum plate.et counts in a mammal. TPO is 
preferably capab.e of increasing endogenous plate.et counts' by a, least 10%. more 
preferably by 50%, and most preferably capable of e.evat.ng P , a ,e,e, counts In a 
human to greater that 150X109 per liter of blood. 

"Isolated m P , ligand nucieic acid" is RNA or DNA containing greater man 16 and 
preferably 20 or more sequential nudeotide bases that encode bio.og«ca..y active mp, 
J: 39 " 16 " 1 there0 '' * C ° mp,ementaf V «o the RNA or DNA, or hybridizes to the 

RNA "nr. T rema ' nS S,ab ' y ^ Undef """" 10 ««*«. ™s 

RNA or DNA ,s free from at .east one contaminating source n U cle.c acid wfth which ft is 

normally associated in the nature, source and preferably substantially free of any 

other mammalian RNA or DNA. The phrase free from a, .east one contaminating source 

nuc.e,c ac,d with which it « normally associated" Includes the case where the nucleic 

acd ,s present in the source or natural cell but is in a different chromosomal location 

or ,s otherwise flanked by nucleic acid sequences not normaily found in the source cell 

An example of isolated mp, ligand nucleic acid is RNA or DNA that encodes a bio.ogica.ty 

" 9and T 9 81 ,MSt 75% SBqUenCe ' den,,ty ' m ° re « 'east 

orel, oT " 9rab,y 81 ^ 85% ' 8V8n m ° re most 
preferabiy 95% sequence Identity with the human, murine or porcine mpl ligand 

"Control sequences' when referring to expression means DNA sequences 

necessary for the expression of an operably linked coding sequence In a particular host 

organism. The contro. sequences that are suitable for promotes, for example 

.ndude a promoter, opt.ona.ly an operator sequence, a ribosome binding site, and 

7*' 0,Her " Vet P °° riy underetood *«*™°*- Eukaryotic cells are known to 
utilize promoters, polyadenylation signals, and enhancers. 
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'Operabiy linked" when referring «o nucleic acids means that the nucleic acids 
are placed ,n a functional relationship with another nucleic acid sequence. For 
example, DNA for a presequence or secretory leader Is operabiy .inked to DNA for a 
po ypep de If it Is expressed as a preproteln that participates «n the secretion of the 
polypept de ; a promoter or enhancer is operabiy .inked to a coding sequence if it 
affects the transcription of the sequence; or a ribosome binding site is operabiy linked 
to a coding sequence If it is positioned so as to facilitate translation. Generally 
"operabiy linked" means that the DNA sequences being linked are contiguous and in the 
case of a secretory leader, contiguous and in reading phase. However, enhancers do not 
have to be contiguous. Linking is accomplished by ligation at convenient restriction 
sites, if such sites do not exist, the synthetic oligonucleotide adaptors or linkers are 
used in accord with conventional practice. 

"Exogenous" when referring to an element means a nucleic acid sequence that 
Is foreign to the cell, or homologous to the cell but in a position within the host cell 
nucleic acid in which the element is ordinarily not found. 

'Cell," "cell line," and "cell culture" are used interchangeably herein and such 
delations include all progeny of a cell or cell line. Thus, for example, terms like 
■transformants- and "transformed cells" include the primary subject cell and cultures 
derived therefrom without regard for .he number of transfers. It is also understood 
that all progeny may not be precisely identical in DNA content, due to deliberate or 
.nadvertent mutations. Mutant progeny that have the same function or biological 
activrty as screened for in the originally transformed cell are included Where 
dist.nct designations are intended, it will be clear from the context. 

•Plasmids" are autonomously replicating circular DNA molecules possessing 
independent origins of replication and are designated herein by a lower case V 
preceded and/or followed by capital letters and/or numbers. The starting plasmids 
herein are either commercially available, publicly available on an unrestricted basis 
or can be constructed from such available plasmids in accordance with published 
procedures. In addition, other equivalent plasmids are known In the art and will be 
30 apparent to the ordinary artisan. 

■Restriction enzyme digestion" when referring to DNA means catalytic cleavage 
of internal phosphodlester bonds of DNA with an enzyme that acts only at certain 
locations or sites in the DNA sequence. Such enzymes are called "restriction 
endonucleases". Each restriction endonuclease recognizes a specific DNA sequence 
called a -restriction site' that exhibits two-fold symmetry. The various restriction 
enzymes used herein are commercially available and their reaction conditions 
cofactors, and other requirements as established by the enzyme suppliers are used 
Restriction enzymes commonly are designated by abbreviations composed of a capital 
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letter followed by other letters reorespntin^ ^ 

restriction enzvm* • • „ repr6Sent,n 9 the m.croorganism from which each 
restriction enzyme originally was obtainpri 3 nw *u 

particular onzyro.. ,„ I J a ,1m , , deSi9na " n9 
.bout ,-2 J. „, .1 ° °" a "°* " DNA *•»'"•"' " »">« UK 

. r» , T;:rr:: o, r r -* 1 ~ — « 

raoovorw , rom lh , ^ by wcip|tj||OT ^ 

«Wh» DNA hp« «, ». reslric , ion ss , Un , eM 0 , h9n . sii 

lo, *, p *os p r,o,y,..lon .„ co „ v ,„, iona| „ |n sMtens » 

£77" "- * tM "'"" "»""' W- V.* WO- S„* 9 

Harbor Laboratory Press, 1989]. 

mean* " R9 T ° f ' iS ° ,a,i0n " °' 8 9iVen fra9ment * DNA '™ ■ -stolon digest 

2TZ mant °' in,erest by comparison o< i,s mobii ^ — *■» - 

ma ke DNA fragments of known mo.ecu.ar weigh., remova. of the ge , section 
coning , hs desired fragment, and separation of the ge, from DNA. This procTr 
known generany. For examp.e. see Lawn et al ., Nudeic Acids ^ * 
[1981], and Goedde. et al.. Nucleic Acitis Res., 8:4057 [1980] 

of DNA r hem anafyS,S " ° r " S0Uthem bk,tt,n9 " * 8 m8th0d * *• P-sence 
of DNA sequences ,„ a restriction endonuc.ease digest of DNA or DNA-containin^ 

compose is confirmed by hybridization to a *ow, labeled ollgc.uc.eo Z^Z 
ragmen, Southern ana, ysis ty plcalIy involve8 ™ 

d-gests on agarose ge«s. denaturation of the DNA after e.ectrophoretic separation and 

r:! 0 ' ,h ; h DNA to ntooce,,u,o8e ' ^ - — * «— 

for wrth a radio.abe.ed. biotlnylated. or enzyme-labeled probe as described ,n 

sections 9.37-9.52 of Sambrook et al., supra. 

•Northern analysis" or -Northern Wotting- , 8 « method used to identify RNA 

zrr at hybrid,ze to a known probe such m " ^ 

srasT^r 0 '' ° f ^ ^ Probe * ■*« * a 
ZZ 7 7 ^ " ^ " 6nZyme - ™« RNA to be anaiyzed , usuaity 
electrophoretlcally separated on an agarose or pofyacry.am.de ge.Tansferred to 
n„oce„ U ,ose, nyion. or oth tab.e membrane, and hybridized ^11 



-31 



25 



30 



35 



WO 95/18858 

PCT/US94/14553 

using standard techniques well known in thn 

7.39-7.52 of SamoJ et s, " ^ d9SCribed SeCti ° nS 

acid J^F 7 T" °' ^ Ph0Sph ° d ^' bond, between two nucleic 

5 o! 7 H 9 " °' ^ *" ,ra9men,S ' fh9 - *• '-gments must be 

5 compatible with each other In At _ 

endonuciease ^ t J^Z^ "* ^ *"* C ° mPa " b,e 

. , "owever, it may be necessary first to convert the staggered 

oirr, : produced af,er endonuc,ease * — - — «^ 
for a z i* For b,un,ing ,he ends - me dna 18 — * * — * 

> 111 m ' 68 ^ 15 °° W ' th *"* 10 Un " 8 °' the K,an - fra 9—t of DNA 

3ZTT P ° ,ymeraSe "* PreSenC ° ° f 1,18 ,0W ^-Ceotide 

osp ates. The DNA is then purified by phenol-chioroform extraction and e.hanol 

Z T ,ra9men,S "* " * *~ *»— are put in so,ution in 

about equ.mo.ar amounts. The so.ution wii. aiso contain ATP, ilgase buffer, and a 

gase , sue* as T4 DNA Ifcase at about io un«s per 0.5 ng of DNA. , the DNA * to be 
l-ga ted into a vector, the vector .s firs, iinearized by digestion with the appropriate 
restneon endonuc.ease(s>. The iinearized fragment is then treated with bacteria, 
alkahne phosphatase or calf intestinal phosphate to prevent self-lotion during the 
ligation step. 

-Preparation- of DNA from cells means isolating the plasmid DNA from a 

a" "I! ^ .""I COmm ° nly USBd ^ ,W ° NA - ^e fcrge- 

and small-scale plasmid preparations described in sections 1.25-1.33 of Sambrook* 

,1 a 7 ra h Af,6 ; h Preparation - *• ONA, it can be purified by methods wei, Known in 
the art such as that described in section 1.40 of Sambrook et a/., supra 

-Oliflonuceotides- are short-leng.h, sing... or doub.e-stranded 
polynucleotides that are chemically synthesized by known methods (such as 
Phosphotriester phosphite, or phosphorate chemistry, using soild-phase 
echn.quea such as described In BP 266.032 published 4 May ,988 or via 
deoxynuc.eoside H-phosphonate intermediates as described by Froehler eta,.. Nuc, 

ZoTL^rZ l1986,) - FU,thar M ^ "» ■"*»"« chain 

reason defined below and other autoprimer methods and oiigonucieotide syntheses on 

TZTZZ? *~ method8 are described * Eng * 8 * *~ *»• 

1 J! (1989) ' ^ meth0d8 ar ° « d " «" ™*»* add 

sequenoBM tf .he gene is known, or the sequence of the nucieic add complementary to the 

coding strand Is availabie. Attentively, if the target amino add sequence is known 

one may infer potential nuctoic acid sequences using known and preferred coding 

re idues for each amino acid residue. The oligonucleotides are then purified on 

polyacrylamide gels. 
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Polymerase chain reaction* or "POP- r«f«„ . 
whirh m i„. . e,ers t0 a P^edure or technique in 

which minute amounts of a SDecific ni«r» *t , ■ 

,.. u specmc piece of nucleic acid, RNA and/or DNA are 

amplified as described in U S Patom m 

b - Patent Na 4 .683.195 issued 28 July 1987. Generalfy 
seq ence ^formation from the ends of the re g ion o, interest or beyond needs to 

° ,i90nUCl60,ide ~» bS ^ P"--s I 

den^ or s,m ,lar ,n sequence to opposite strands of the tempiate to be ampiified. The 

JZ ° f ^ Prim8rS ^ C ° inCide With the — * ampiified 
matena . PGR can be used to ampiify specific RNA sequences, specific DNA sequences 
from totai genomic DNA. and cDNA transcribed from tota. celiuiar RNA, bacteriophage 
or piasmid sequences, etc. See genera.iy Mu.iis et a,.. Co,, Spring HarPor Symp. 
Ouan- ^,.,SV.2S 3 r lfl87]: Er.ich, ed.. POP, Tecnno,ogy, (Stockton Press. NY, 
1989). As used herein, PGR is considered to be one. but not the only, example of a 
nuclei acid poiymerase reaction method for amplifying a nucleic acid test sample 
compnsing the use of a known nucieic acid as a pnmer and a nucieic acid poiymerase 
i 5 to amplrfy or generate a specific piece of nucleic acid. 

'Stringent conditions" are those that (1) emp.oy low ionic strength and high 
temperature for washing, for example. 0.015 M NaCI/0.0015 M sodium citrate/0 1-/ 
NaDodS0 4 (SDS, at 50-C. or (2) emp.oy during hybridation a d^^^ 
as formamide. for example, 50% (vo./voi) formam.de with 0.1% bovine serum 
20 a ,bum,n/0,% Rco.,0,% P o,yviny, P yrro.idone/50 mM sodium phosphate buffer at 
pH 6.5 with 750 mM Nad. 75 mM sodium citrate at 42'C. Another examp.e is use of 
50/. formamide, 5 x SSC (0.75 M Nad. 0.075 M sodium citrate,. 50 mM sodium 
Phosphate (pH 6.8). 0,% sodium pyrophosphate, 5 x DenhardTs so.ution. sonicated 
salmon sperm DNA (50 .g/ml). 0.1% SDS, and 10% dextran sulfate at «-C. with 

2 5 washes at 42'C in 0.2 x SSC and 0.1 % SDS. 

'Moderately stringent conditions- are described in Sambrook et a,., supra, and 
nc ude the use of a washing solution and hybridization conditions { e.g., temperature, 
■on* strength, and %SDS , less stringent than described above. An exampie oi 
moderately stringent conditions are conditions such as overnight Nation at 37-C in 

3 0 a solution comprising: 20% formamide, 5 X SSC (150 mM NaCI, 15 mM tnsodium 

ctrate). 50 mM sodium phosphate ( P H 7.6). 5 X Denhardfs solution. 10% dextran 
sutete, and 20 ul/m. denatured sheared saimon sperm DNA, followed by washing the 
»~ in 1 X SSC a, about 37.50'G. The skilied art.an wi„ recognize how to Z£ 

35 :; g ;™r etc - as necess " ,o — - - *~ 

"Antibodies- (Abs) and "immunoglobulins- (Igs) are glycoproteins having the 
same structural characteristics. While antibodies exhibit binding specificity to a 
specie antigen, immunoglobulins include both antibodies and other antibody-like 
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mo.ecuies which lack antigen specificity. Polypeptides o, the ,atter kind are for 
example, produced at low levels bv th« i™nh . 

myelomas. " ^ "* * increasad lavela * 

"Native antibodies and immunoglobulins' are usually he.erotetrameric 

' SIT h 15a0 °° dalt0nS> * «" """«"*» « — - 

*o denial heavy (H) chains. Each Ugh, chain is linked to a heavy chain by one 
"'sulfide bond, while the number of disuifide ..nkages val between Z 
eZ di " erent immUn ° 9,0bulin heavy and ..ght chain also has 

^ diSU,fide brid9eS ' ^ h6aVy Chain has * « 
vanable domain (V H ) foHowed by a number of constant domains. Each light chain has a 

vanable domain at one and (VJ and a constant domain at its other end; the constant 
domain of the light chain is aligned with the first constant domain of the heavy chain 
and the light chain variable domain is aligned with the variab.e domain of .he heavy" 
chain. Particular amino acid residues are beHeved to form an interface between the 
light and heavy chain variable domains (Clothia et a/., j; uoL Biol., 186:651-663 
[1985]; Novotny and Haber, Proc. Nat,. Acad. Sci. USA. 82:4592-4596 [1985]) 

The term "variable' refers to the fact that certain portions of the variable 
domains differ extensively in sequence among antibodies and are used in the binding and 
spec-fcity of each particular antibody for its particu.ar antigen. However, the 
vanabNity ,s not evenly distributed through the variab.e domains of antibodies It is 
concentrated in three segments called complementarity determining regions (CDRs) or 
hypervariable regions both in the Ugh, chain and the heavy chain variable domains 
The more highly conserved portions of variab.e domains are oal.ed the framework 
Th8 Variab ' e d0mains of natlve and light chains each comprise four FR 

rag,ons. largely adopting a p-sheet configuration, connected by three CDRs. which 
form loops connecting, and In some cases forming part of. the p-sheet structure. The 
CDRs ,n each cha,n are he.d together in dose proximity by the FR regions and. with the 
CDRs from the other chain, contribute to the formation of the antigen binding site of 
anybodies (see Kabat et a,.. Seances of Proteins of i^unofogica, Merest. National 
nstttute of Health. Bethesda. MD [1987],. The constant domains are not Invoived 
directly in binding an antibody to an antigen, but exhibit various effector functions 
such as participation of the antibody in antibody-dependent cellular toxicity 

can*. ZT, di9B8,,0n * amib ° di8S Pr ° dUC6S l "° "ndhg fragments, 

called Fab fragments, each with a single antigen binding elte. and a residua, 'FC 

fragment, whose name reflects its ability to crystallize readily. Pepsin treatment 
y.e.ds an F(ab-) 2 fragment that has two antigen combining sHes and is « o^STS 

cross-linking antigen. 
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racoon a ;;i: n :rr: y ,ra9ment mch — * — 

chain variable domain C ° nS ' S,S °' 9 ^ °' ^ ^ °" e 

.hat ,he three CoZrt J I ^ " iS «" *• 

« y t0 the anti ; y L :: inrr*; six CDRs co ,i9en bindina 

comprising only three CDRs sol r ' S,n9 ' e Vanab ' 8 domain •»» of an Fv 

bind .„,- ! P ' C ,0f a " anti9en) has ,he ^i'ity to recognize and 

b.nd an„gen. a.though at a ,ower affinity than the entire binding site 

) first JJLTh ,ra9 ' T,e^, a,S ° C ° n,ainS me d ° main ° f <h9 '* ht chai " <"d the 

■rst constant doma.n (CH,) of the heavy chain. Fab' fragments differ from Fab 

CH doma,n ( nc,ud,ng one or more cysteines from the antibody hinge region. L-SH 

ar a « h ,ol group. F(ab') 2 antibody fragments originally were produced 
pa.rs of Fab' fragments which have hinge cysteines between them. Other chemical 
coupling, of antibody fragments are also known. 

The -light chains' of antibodies (immunoglobulins) from any vertebrate 

TIT "I aSSi9 " ed ,0 " *~ ~* « kappa an d atbl 

(X). based on the am.no acid sequences of their constant domains 

chain, DePendi T " amin ° S8qUenCe °' me of their heavy 

chains, .mmunoglobullns can be assigned to different Casses There are five mZ 
classes of immunoglobulins: IgA. IgD IgE l«G and InM T J 
f-her divided into subclasses isotyp^ * tT» ,TH " ^ * 
1 and,g A - 2 . The heavy chain constat « 
of immunogiobulins are called a, delta epsilon y aT 

structures and th». * lm Y ' respectlve,v - The «*unlt 

siructures and three-dimensional configurations of dlff*r«nt .1 

immunoglobulins are well known. ' C ' aSSeS 0f 

The term "antibody- is used in the broadest sense and specifically covers *«. 
monoclonal antibodies (including aaonist and „«♦ • ^ 
compositions w«h polyepLpic s HZ L we, " ZZ V"* ^ 
and Fv). so ,ong as ^^^J^S^T ^ 

from . 7h * T " m0nOC,0na ' ■ nMb °*" 38 USed here,n re,ere «o antibody obtained 
from a popuiation of substantia.lv homogeneous antibodies /. ,h» JZT^ 
antibodies comprising the popuiation are identica. e«ep 0 r DOS8 b 
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monoclonal antibody is directed against a single determinant on the antigen. In addition 
to their specificity, the monocional antibodies are advantageous in that they are 
synthesized by the hybridoma culture, uncontaminated by other immunoglobulins. The 
modifier "monoclonal" indicates the character of the antibody as being obtained from a 
5 substantially homogeneous population of antibodies, and is not to be construed as 
requiring production of the antibody by any particular method. For example, the 
monoclonal antibodies to be used in accordance with the present invention may be made 
by the hybridoma method first described by Kohler & Milstein. Nature, 256:495 
(1975), or may be made by recombinant DNA methods (see, e.g., U.S. Patent No 
1 0 4,816,567 [Cabilly et a/.]). 

The monoclonal antibodies herein specifically include •chimeric" antibodies 
(immunoglobulins) in which a portion of the heavy and/or light chain is identical with 
or homologous to corresponding sequences in antibodies derived from a particular 
species or belonging to a particular antibody class or subclass, while the remainder of 
the chain(s) is identical with or homologous to corresponding sequences in antibodies 
derived from another species or belonging to another antibody class or subclass, as 
well as fragments of such antibodies, so long as they exhibit the desired biological 
activity (U.S. Patent No. 4.816.567 (Cabilly et a/.); and Morrison et al., Proc. Natl. 
Acad. Sci. USA, 81:6851-6855 [1984]). 

•Humanized" forms of non-human (e.g., murine) antibodies are chimeric 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab' 
F(ab') 2 or other antigen-binding subsequences of antibodies) which contain minimal 
sequence derived from non-human immunoglobulin. For the most part, humanized 
antibodies are human immunoglobulins (recipient antibody) in which residues from a 
complementary determining region (CDR) of the recipient are replaced by residues 
from a CDR of a non-human species (donor antibody) such as mouse, rat or rabbit 
having the desired specificity, affinity and capacity. In some instances, Fv framework 
residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Furthermore, humanized antibody may comprise residues which are found 
neither in the recipient antibody nor in the imported CDR or framework sequences. 
These modifications are made to further refine and optimize antibody performance. In 
general, the humanized antibody will comprise substantially all of at least one. and 
typically two. variable domains, in which all or substantially all of the CDR regions 
correspond to those of a non-human immunoglobulin and all or substantially all of the 
FR regions are those of a human immunoglobulin consensus sequence. The humanized 
antibody optimally also will comprise at least a portion of an immunoglobulin constant 
region (Fc). typically that of a human immunoglobulin. For further details see: Jones 
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et a/., Nature, 321:522-525 fiqpfii. D • ,_ 

inaet and P,. s „. c „„ % - «■ 3»: 32 3. 329 

ourr. up. sfrtycf. Biol., 2:593-596 (19921) 

5 appropriate J.T.T 3 "»"»>•"«=.•* -f«v. amount wlth „. 

II. Preferred Embodiment, of the Invention 

' 0 Pt.lerred polypeptide, 0| „„ towta , ,„ subsla „ t| h 

C™, rT r p9ny 01 s "" ,ukieno ^"o™""" - «— — °»*» 
, s lt P r , °' ,l " 3 " op9nd,m ^ •* •«-««' •» «™.» 

P. Mo,, p,e,.„. d m , , igand(! , .,. ^ m 

e™..po,.„ c , . speclally m , 9akaryocylowletle or lhrom6ocy , 0 ; let]c , ~ 

nan,.,,. be,„ g capab.e o, slimuatag m «.,.„,, L 

* MFOiM. . truncated form „, hMl. r«^ ,„ , s ^ or '» 
mature u potvpepad. h . vlng « 
ID NO. „. re.er.d „ a. hMl. „ Ml33 , 0, „TP0 332 end m . bloloej,, 
substitutional variant hML(R15 3 A, R154A). 

Optional preferred po,,p.p„de, „ , n> „ lta) 

rrrtrirr - 
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Optional preferred polypeptides of this invention are biologically active mo, 
-gand variants, that have an amino acid sequence having at leaa ^ ^ 2, 
sequence Identity with the human mp/ , igan d (see Fig. 1 [SE Q ID L it T 
mpl llgand (see Fig. 16 [SEQ ID NOS- 12 4 L 1 D ' mU " ne 

(see Fl« 10 reen ,n 1 D ' ** "^""a* Porcine mp/ ligand 

(see Fig [SEQ ID NO: 18]) or the porcine mp/ ligand Isolated from aplastic 
porcine plasma, preferably at least 75%, more preferably at least 80* JT 

preferably at least 88%. even more preferably at .east Z [ 7 

least 95%. ^raoiy at . east ^ fln(J mosf preferab|y ^ 



•37- 



WO 95/18858 



PCT/US94/14553 



The m P , llgand isolated from aplastic porcine plasma has the following 

characteristics: 

-.h pp<! d« Partla,,y ^""^ ,i9and e,Ut6S from a m ^n column run in 
e,ther PBS, PBS containing 0.1% SDS or PBS containing 4M MgCI 2 with Mr of 
5 60,000-70,000; 

< 2 ) The ligand's activity is destroyed by pronase; 

( 3 ) The ligand is stable to low pH (2.5), SDS to 0.1%, and 2M urea; 

( 4 ) The ligand is a glycoprotein, based on its binding to a variety of lectin 
columns; 

10 ( 5 ) The highly purified ligand efutes from non-reduced SDS-PAGE with a 

Mr of 25,000-35,000. Smaller amounts of activity also elute with Mr of -18,000- 
22,000 and 60,000; 

( 6 ) The highly purified ligand resolves on reduced SDS-PAGE as a doublet 
with Mr of 28,000 and 31,000; 

15 (7) The amino-terminal sequence of the 18,000-22,000, 28,000 and 

31 ,000 bands is the same - SPAPP ACDPRLLNKLLRDDH VLHGR <SEQ ID NO: 29); and 
( 8 ) The ligand binds and elutes from the following affinity columns 
Blue-Sepharose, 
CM Blue-Sepharose, 
2 0 MONO-Q, 

MONO-S, 

Lentil lectin-Sepharose, 
WGA-Sepharose, 
Con A-Sepharose t 
25 Ether 650m Toyopearl, 

Butyl 650 m Toyopearl, 
Phenyl 650m Toyopearl, and 
Phenyl-Sepharose. 

More preferred mp/ligand polypeptides are those encoded by human genomic or 
3 0 cDNA having an amino acid sequence described in Rg. 1 (SEQ ID NO: 1). 

Other preferred naturally occurring biologically active mpl ligand 
polypeptides of this invention include prepro-mp/ ligand, pro-mp/ ligand, mature mpl 
ligand, mpl ligand fragments and glycosylate variants thereof. 

Still other preferred polypeptides of this Invention include mpl ligand sequence 
variants and chimeras. Ordinarily, preferred mpl ligand sequence variants and 
chimeras are biologically active mpl ligand variants that have an amino add sequence 
having at least 70% amino acid sequence identity with the human mpl ligand or the mpl 
ligand isolated from aplastic porcine plasma, preferably at least 75%, more 
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rir r more pre,erab,y * ,Mst 8s% - - — ~ - 

vanant b a N,erm,nal domain hML variant (refered ,o as the "EPO-domain» bede 

5 i r q 7 ce homo, ° 9y ,o ery,hrop ° ietin) - The ^ ^ e o^: 

™L 15 . An op„ona-,y peered hML sequence variant comprises one in which one " 

sublV " aSiC " d ' baSiC am,n ° fe5idU8(s ) *° C^W domain , s 
8 " 0n - baSiC 3min0 ^ -*"W neutral, a.dic 

aromat,. Giy, Pro and the ii k e). A prefered hML C-termina. domain sequence variant 

ThrZnt 0 " 9 ^ ^ r6SidUeS 154 " "» A <* 

Th v r, ant ls refered , 0 as hML332(R153A R154A) ^ aj(emat ^ prefered h ^ 

vanant compnses either hML 332 or hML 153 in which amino residues 111-H4 
AGAG 0 °r r ,h P ^ are T de ' e,ed " r6PlaCed ^ 3 di,er6nt ~Mnc«.,. 
1 5 ,0r690in9 de,6,l0n ^ ^ ref6red t0 " " hML332 * 

A preferred chimera is a fusion between mp/ .igand or fragment (defined 
below, ther eof with a hetero.ogous polypeptide or fragment thereof. For example 
hML 153 may be fused to an .gG fragment to improve serum ha.f,i»e or to .L 7t 

20 ZZZ a mo,ecu,e wi,h inhanced thro ~ or — 

An alternative pre.erred human mp/ .igand chimera is a "ML-EPO domain 
cWa' tha, consists of the N-terminus 153 to ,57 hML residues substftuted 1 

2 5 S 1 ^ ,h ' S emb ° diment ' the hML Chinwa °e about ,53- 

2 5 166 rescues ,n .ength in which individual or bfccKs of residues from the human eZ 

sequence are added or substituted into the hML sequence a, positions correspond^ to 

12ZT 7" ^ 10 (SE ° ' D EXemP '^ W °* ■"■«"- 

I o me N-terminus portion o, hML would inCude one or more* the N-g.ycosy,a«on 

30 edi t / 0Siti0nS (EPO) ^ 38 " 40 ' " - - more of Te t 

3 0 predicted amphipathlc o-heiica. bund.es at positions (EPO, 9-22. 59-76 90-107 

teir 152 : ^ ° ,her h ' 9h,y ~ ^ - vermin » 1 C 

terminus regwns and residue positions (epo, 44-52 (see e.g. Wen et a, aJL 

1993,). ,« ,s contempiated this 'ML-EPO domain chimera" wii, have mixed 
15 thrombopoietic-erythropoietic (TEPO, biological activity 

hav.no ^ °' lhis in ™ «on Inciude mp/ ligand fragments 
ha*ng a consecutive sequence of a, .east 10. 15. 20 . 25 . 30. or 40 amino acid 
rescues that are identica, to the sequences of the mp/,i gand isolated from a J s 
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porcine plasma or the human mp, ligand described herein (see e.g. Table 14 
Example 24). A preferred mp, ligand fragment is human ML[1-X] where X is 153 
164, 191, 205, 207, 217. 229, or 245 (see Fig. 1 [SEQ ID NO: 1] for the sequence 
of residues 1-X). Other preferred mpl ligand fragments include those produced as a 
result of chemical or enzymatic hydrolysis or digestion of the purified ligand. 

Another preferred aspect of the invention is a method for purifying mpl ligand 
molecules comprises contacting a mpl ligand source containing the mpl ligand 
molecules with an immobilized receptor polypeptide, specifically mpl or a mpl fusion 
polypeptide, under conditions whereby the mpl ligand molecules to be purified are 
selectively adsorbed onto the immobilized receptor polypeptide, washing the 
•mmobilized support to remove non-adsorbed material, and eluting the molecules to be 
purified from the Immobilized receptor polypeptide with an elution buffer. The source 
containing the mpl ligand may be plasma where the immobilized receptor is preferably 
a mpAIgG fusion. 

Alternatively, the source containing the mpl ligand is recombinant cell culture 
where the concentration of mpl ligand in either the culture medium or In cell lysates 
is generally higher than in plasma or other natural sources. In this case the above 
described mpAIgG immunoaffinity method, while still useful, is usually not necessary 
and more traditional protein purification methods known in the art may be applied. 
Briefly, the preferred purification method to provide substantially homogeneous mpl 
ligand comprises: removing particulate debris, either host cells or lysed fragments 
by, for example, centrifugation or ultrafiltration; optionally, protien may be 
concentrated with a commercially available protein concentration filter; followed by 
separating the ligand from other impurities by one or more steps selected from; 
immunoaffinity. ion-exchange (e.g., DEAE or matricies containing carboxymethyl or 
sulfopropyl groups). Blue-Sepharose. CM Blue-Sepharose. MONO-Q, MONO-S, lentil 
lectin-Sepharose. WGA-Sepharose. Con A-Sepharose. Ether Toypearl. Butyl Toypearl. 
Phenyl Toypearl. protein A Sepharose. SDS-PAGE. reverse phase HPLC (e.g., silica gel 
with appended aliphatic groups) or Sephadex molecular serve or size exclusion 
chromatography, and ethanol or ammonium sulfate precipitation. A protease inhibitor 
such as methylsulfonylfluoride (PMSF) may be included in any of the foregoing steps 
to inhibit proteolysis. 

In another preferred embodiment, this invention provides an isolated antibody 
capable ol binding to the mpl ligand. A preferred mpl ligand isolated antibody Is 
monoclonal (Kohler and Milstein, Nature, 256:495-497 [1975]; Campbell. 
Laboratory Techniques In Biochemistry and Molecular Biology, Burden et al., Eds.' 
Volume 13, Elsevier Science Publisrers, Amsterdam [1985]; and Huse et al., Sdence' 
246:1275-1281 [1989]). Preferred mpl ligand isolated antibody is one that binds 
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to mpl ligand with an affinity of at inaot „k 4 
♦■k A > . ^ * at least about ^6 |/mole. More preferably the 

aTl r , VMP ' k,, ,n " an " boay '" !i0 " «""»' "° >» 

POIVP.PM.. in , tone, p,., ema aspM „, Ms 8mboalmen , inv8ntk)n ^ 
m.m M , or del , ct ,„ 9 „ e |igaM (n v(m or (n ^ ^ 

a**, „,„, , samplei , semm sjmple> suspecte(| o( ^ ^ 

i 0 and detecting if binding has occurred. 

In still further preferred embodiments, the invention provides an isolated 
nuc e lc acid molecule encoding the mp, ligand or fragments thereof, which nucleic acid 
mo ecule may be labe.ed or unlabeled with a detectable moiety, and a nucleic acid 
molecule having a sequence that is complementary to. or hybridizes under stringent or 
moderated stringent conditions with, a nucleic acid molecule having a sequence 
encodmg a mp/ ligand. A preferred mp, ligand nucleic acid is RNA or DMA that encodes 
a .ologica.lv active mp/ iigand sharing at leas. 75% sequence identity, more 
prefab y at least 80%, still more preferably at least 85%, even more preferably 
90 .. and most preferab.y 95 % sequence identity with the human mp, ligand. More 
2 0 preferred .so.ated nudeic acid mo.ecu.es are DNA sequences encoding biological* 
act.e mp/ ,ig and , se.ected from: (a) DNA based on the coding region 0, a mammaiian 
mp/ gand gene (e.g., DNA comprising the nucleotide sequence provided in Fig. 1 (SEQ 
N ° : 2)> ° r fra9mentS ,hereo '>; < b > DNA capable of hybridizing to a DNA of (a) 
under at least moderate* stringent conditions; and (c) DNA that fe degenerate to a DNA 
2 5 defmed ,n (a) or (b) which resu.ts from degeneracy of the genetic code .t is 
contemplated that the nove. mp/ligands described herein may be members of a family 

ZT'nlVT ^ 8Ui,ab,e 8eqU9nCe *" ^ DNA T 

w th the DNA of F, g . . (SE Q ,D NO: 2 , (or the compiement or fragments thereof) 

under low to moderate stringency conditions. Thus a further aspect of this invention 
30 .ncludes DNA that hybrtdfces under fcw to moderate stringency cond.Uons ZiZ 
encoding the mpl iigand polypeptides. 

in a further preferred embedment of this Invention, the nucleic acid molecule 

TnT! ""V *• mP/ " 9and ' Urther 8 rep,iCab,e — <" which the 

35 1 ; P h ! " ^ ,0 ~** SeqUenC6S reC09ni2ed * * — transformed with the 
vector This aspect further inciudes host cells transformed with the vector and a 
mettod o» using the cDNA ,0 effect production of mp,|, ga nd. comprising expressing the 
CDNA encoding the mp/ ligand in a cutture of the transformed host ceHs and recovering 
the mp/ hgand from the host cel. culture. The mp/ ligand prepared In this manner I 
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preferab.y substantia.* homogeneous human m p/ , igand . A preferred host fQf 
produong mp, l igand is Chinese hamster ovary (CHO) cells 

The invention lurther includes a preferred method for treating a mamma, 
v,n g an .mmuno.ogica. or hematopoietic disorder, especial* throLcytopZ 
cornpr^ng adm.n.stering a therapeutic^ effective amount of a mp„ig ana - to the 
mammal. Opt.onally, the mp/ ligand is administered in combination with a cytokine 
especally a co.ony stimulating factor or interteukin. Preferred colony stimulating 
factors or interleukins include; klt-ligand, LIF, G-CSF. GM-CSF, M-CSF EPO IL 1 
IL-2, IL-3, IL-5, IL-6, IL-7, IL-8. IL-9 or IL-11. 

III. Methods of Making 

Platelet production has long been thought by some authors to be controlled by 
mul.i P .e .ineage specific humoral factors. » has been postulated that two distinct 
cytokine activities, referred to as megakaryocyte colony-stimu.ating factor (meg- 
CSF and thrombopoietin, regu.a.e megakaryocytopoiesis and thrombopoiesis 
m -ams j. C e„ P^oA, 1 1 0:101-104 [1982,; wil.iams et a,., a/ood 

Ce„s, 15:123-133 [1989]; and Gordon et a,., a/cc*. 80:302-307 ng 921) 
According to this hypothesis. meg-CSF stimulates the pro.iferation of progeny 
megakaryocytes while thrombopoietin primarily affects maturation of more 
d.fferentiated cells and ultimately platelet release. Since the 1960's the induction and 
appearance of bom meg-CSF and thrombopoietin activities in the plasma, serum and 
urine of animals and humans following thrombocytopenic episodes has been well 
documented (Ode,, et a, Proc. Soc. Exp . Biol . Med „ w ^ ^ 

49 TSTT' 54:34 °' 344 [1975]: SP6Cter - ^ SOC - **>>• 108:146- 
149 [1961]; Schreiner et a/., J.CIin.invest. 49:1709-1713 [1970]; Ebbe. Blood 
4 ,05.608 [1974,; Hoffman et a,., N . Engl . j. ^ ^^t 

M^m 22-1127 [19881: ^ 01 ^'..13M164 

i 9ta '- J C " n - «:733.741 [1981,; Sheiner et a/.. BiooO 

56: 83-188 [1980]; H„, e, a,, E*p. Hemafo/.. 20=354-360 [1992,; and Hegy, . 
al. Int. J. Cell Cloning, 8:236-244 [1990]). These actMt,es were reported to be 
neage 8peclflc and ^ ^ ^ ^ r j ^ ^ 

(1992); Erlckson-MHIer C.L. et al.. Br,t. J. Haemato,.. 64:197-203 (1993)- 

rZ a J' E ; ^ fl '" EXR - Hemat0L 20:4750 ^ — Tsukada J. et a,.. Blood 

35 fr m^I T^' """^ » ^ ™*™ or thrombopoietin 

3 5 from thrombocytopenic plasma or urine have been unsuccessful. 

Consistent with the above observations describing thrombocytopenic plasma 
we have found that aplastic porcine plasma (APP) obtained from .rradiated pigs 
simulates human megakaryocytopoiesis /„ vitro. We have found that this stJmu,atory 
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activity is abrogated by the soluble extracellular Hn™- , 

* i wKracenuiar domain o c-mp , confirmina APP a« 

a potential source of the DutatKm - •. wniirming mkk as 

the „p, iigand , rom APP am ^ We h ™ ™ — ^ ^ 

murJ „„ • ' n0 3Cid Sequence information was used to isolate 

murine, porcine and human ML cDNA Th»« mi . u 

> erythropoietin and have both me g -C SF and h b S6qU6nCe h0m ° ,09y * 

meg csf and thrombopoietin-like activities. 

t,mu at«y ,ac,or „ as prevlously 6 , en rwM i50|MM ^ " 

Z.T2 T T wa VM * ™ asu,i " s 3H " hyml< "- «• 

n.. 2. app 3„, hvmiaine lneorporatlo „ , Mo 

~2 „ bserv9(1 , n nomla , porcln9 ^ Thtse J =h 
t - * - **■ * — » - *pp mi. l»^JZ?Z 

stimulator, .ff«a, 0 I APP or, B«/F3-mp; cells. 
Ths acM, in 4PP „ p . arM „ „ , 

destroy the activity in APP fFln r n, . ' ' or neat 

y cimiy APP (Fig. 3 ). The activity was also non-dialyzable The 
v-ty was. however, stabie to iow pH (pH 2 , tor 2 hrs , and was shown to bind and 
elute from several lectjn . affinjty co|umn8i jndicatjng (hat g(ycoprotejn da ; o d 
^ er lucdate ^ stmcture ^ jdent|ty of thjs actjvjty j( J™ 

from APP using a m P HgQ chimera. 

Brie,.v IT TrT aCC ° rdin ° '° ^ ^ "* ,0rth in E "«""" 1 «- a. 

(H C 2oM ST PUfified ^ hydr ° Ph0biC ""»"■" chromatography 
H c. .mmobi.ized dye chromatography, and mpAaffinity chromatography The 

provided in Tab.. 1. The overall recovery of activity through the mpZ-affinitv 
coiumn was approximate^ 10%. The peak activity »ract.on L frorTl " 

purine «o„ from 5 .iters o, APP was approximate,, 4 x1 0 6 fold (0 8 unJZt 
3-3 x 06 unit8/mg) wjth a a3 x 1o6 . fo|d reductjon ^ ( n^/mg to 

zzrjzr activi,y of ,he ,i9and e,uted from - ~ ~ 
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Protein was determined by the Bradford assay. Protein concentration of mpAeluted fractions 
5-7 are estimates based on staining intensity of a silver stained SDS-gel. One unit is defined 
as that causing 50% maximal stimulation of Ba/F3-mp/ cell proliferation. 
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Analysis of eluted fractions from the mpl affinity column by SDS-PAGE (4- 
20%, Novex gel) run under reducing conditions, revealed the presence of several 
proteins (Fig. 5). Proteins that silver stained with the strongest intensity resolved 
with apparent Mr of 66,000, 55.000. 30,000. 28,000 and 18.000-22 000 To 
determine which of these proteins stimulated proliferation of Ba/F3-mp/ ce.l 
cultures, the proteins were eluted from the gel as described in Example 2. 

The results of this experiment showed that most of the activity eluted from a 
gel sl.ce that included proteins with Mr 28,000-32.000. with lesser activity eluting 
in the 18,000-22.000 region of the gel (Fig. 6). The only proteins visible in these 
regions had Mr of 30.000. 28.000 and 18.000-22.000. To identify and obtain 
protein sequence for the proteins resolving in this region of the gel (i.e. bands at 30 
28 and 18-22 kDa). these three proteins were electroblotted to PVDF and sequenced as" 
described .n Example 3. Amlno-terminus sequences obtained are provided in Table 
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TABLE 2 

Mpl Liqanri Amlno.T*rmin„ a 



Sequences 



30 kDa 

1 5 



10 



15 20 25 

(S)PAPPA(C)DPflLLM^i i BDDfH/ s)VLHi 



28 kDa 

1 5 10 15 20 25 

(S|PAPPAXDPRLIMKi ,LflDD(HlVLMfl B 



(G)RL 



18-22 kDa 

1 5 10 
XPAPPAX DPRLX(N)(K) 



(SEQ ID NO: 30) 



(SEQ ID NO: 31) 
fSEQ ID NO: 321 



Computer-assisted analysis revealed these amino acid sequences to be novel. 
Because all three sequences were the same, it was believed the 30 kDa. 28 kDa and 
18-22 kDa proteins were related and might be different forms of the same novel 
protein. Futhermore, this protein(s) was a likely candidate as the natural mpl ligand 
because the activity resolved on SDS-PAGE in the same region (26,000-32,000) of a 
4-20% gel. In addition, the partially purified ligand migrated with a Mr of 17 000- 
30,000 when subjected to gel filtration chromatography using a Superose 12 
(Pharmacia) column. It is believed the different Mr forms of the ligand are a resuit of 
proteolysis or glycosylate differences or other post or pre-translatlonal 
modifications. 

As described earlier, antisense human mpl RNA abrogated 
megakaryocytopoiesis in human bone marrow cultures enriched with CD 34+ 
progenitor cells without affecting the differentiation of other hematopoietic cell 
1-neages (Methia ef al., supra). This result suggested that the mpl receptor might play 
a role in the differentiation and proliferation of megakaryocytes In vitro. To further 
elucidate the role of the mpl ligand in megakaryocytopoiesis. the effects of APP and mpl 
Lgand depleted APP on m vitro human megakaryocytopoiesis was compared. The effect 
of APP on human megakaryocytopoiesis was determined using a modification of the 
Uquid suspension megakaryocytopoiesis assay described in Example 4. In this assay 
human peripheral stem cells (PSC) were treated wtth APP before and after mpHoG 
affinity chromatography. GPII b lll a stimulation of megakaryocytopoiesis was 
quantified with an 125,. antj .„ bl „ a ant , body {Rg 7) |(j Rg ^ ^ % ^ 

caused approximately a 3-fo.ld stimulation while APP depleted of mpl ligand had no 
effect. Significantly, the mp, ligand depleted APP did not Induce proliferation of the 

Ba/F3-mp/ cells. 
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In another experiment, soluble human mpMgG added at days 0, 2 and 4 to 
cultures containing 10% APP neutralized the stimulatory effects of APP on human 
megakaryocytopoiesis (Fig. 8). These results indicate that the mpl ligand plays a 
role in regulating human megakaryocytopoiesis and therefore may be useful for the 
treatment of thrombocytopenia. 

2. Molecular Cloning of the mpl Ligand 

Based on the amino-terminal amino acid sequence obtained from the 30 kDa, 28 
kDa and 18-22 kDa proteins (see Table 2 above), two degenerate oligonucleotide 
primer pools were designed and used to amplify porcine genomic DNA by PCR. It was 
reasoned that If the amino-terminal amino acid sequence was encoded by a single exon 
then ihe correct PCR product was expected to be 69 bp long. A DNA fragment of this 
size was found and subcloned Into pGEMT. The sequences of the oligonucleotide PCR 
primers and the three clones obtained are shown In Example 5. The amino acid 
1 5 sequence (PRLLNKLLR [SEQ ID NO: 33]) of the peptide encoded between the PCR 
primers was identical to that obtained by amino-terminal protein sequencing of the 
porcine ligand (see residues 9-17 for the 28 and 30 kDa porcine protein sequences 
above). 

A synthetic oligonucleotide based on the sequence of the PCR fragment was used 
to screen a human genomic DNA library. A 45-mer oligonucleotide, designated pR45. 
was designed and synthesized based on the sequence of the PCR fragment. This 
oligonucleotide had the following sequence: 

5' GCC-GTG-AAG-GAC-GTG-GTC-GTC-ACQ-AAG-CAQ-TTT-ATT-TAG-GAQ-TCG 3« 

(SEQ ID NO: 34) 

This deoxyoligonucleotide was used to screen a human genomic DNA library in 
Xgeml2 under low stringency hybridization and wash conditions according to 
Example 6. Positive clones were picked, plaque purified and analyzed by restriction 
mapping and southern blotting. A 390 bp EcoRI-Xbal fragment that hybridized to the 
45-mer was subcloned Into pBiuescript SK-. DNA sequencing of this clone confirmed 
that DNA encoding the human homolog of the porcine mpl ligand had been isolated. The 
human DNA sequence and deduced amino acid sequence are shown in Fig. 9 (SEQ ID 
NOS: 3 & 4). The predicted positions of introns in the genomic sequence are also 
indicated by arrows, and define a putative exon ("exon 3"). 

Based on the human 'exon 3" sequence (Example 6) oligonucleotides 
corresponding to the 3" and 5' ends of the exon sequence were synthesized. These 2 
primers were used in PCR reactions employing as a template cDNA prepared from 
various human tissues. The expected size of the correct PCR product was 140 bp. 
After analysis of the PCR products on a 12% polyacrylamlde gel, a DNA fragment of the 
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ZZTJT TT " CDNA " tafiS! p " w *- — — «»* «• 

fetal kidney eel s and cDNA nnnar^ ^ i_ 

CLMW prepared from human fetal liver 

with theVam' IT ' ibrary (7X1 ° 6 ^ ' ambda ° R2 WaS - 

. r, , : CD ;;;: ii9onucieotide used ,o — - h — -«- ■»* and 

In t f 7 Und9r ' 0W S,fin9enCy hybfidlZation Positive 

clone I 6 ' PlaqUB PUri,iSd ^ ™« "» ~ *~« d » -C". One 

Tc ^r; E * insert was se,ec,ed ,or ^ ^ - p-^s 
xxTwt 0 ::r °r and deduced add * - — 

> NOS: 1 & 2) ^ SeqUenC8S ^ PreSen,6d 1 (SEQ ,D 

3. Structure of the Human mpl Llgand (hML) 
The human mpl ligand (hML) cDNA sequence (Fig. 1 [SEQ ID NO- 21) 
comprises 1774 nucieotides foilowed by a po,y(A) ta,.. „ contains 215 nuc.eot.des of 
5 untranslated sequence and a 3' untranslated region of 498 nucleotides The 
presumed initiation codon at nuc.eotide position (216-218) is within a consensus 

STEJSTT ,or eukaryotic ,rans,ation inma,ion - ^ open readina *— 18 

1059 nucieotides long and encodes a 353 amino acid residue polypeptide, beginning at 
nuc ieo d position vo. The N-,erminus of the predicted amino acid stance I 
ghiy hydrophobic and probab, corresponds to a signal peptide. Computer Irys* J 

TJTT 3min ° aCW SeqU6nCe (V0 " Hei|ne " *' Eur - J - Blo °»™-' "3:17-21 
[1983]) •nd.cates a potentia. cleavage site for signa. pepU dase between residues 21 

and 22. Cleavage at that position wou.d generate a mature polypeptide of 332 amino 

pu fed from porcine plasma. The predicted non-glycosyfcted molecular weight o^ the 
332 am.no acid residue ligand is about 38 kDa. There are 6 potential N-glycosytatlon 
sites and 4 cysteine residues. ywsywron 

Comparison of the mp, ligand sequence with the Genbank sequence database 

Zd an h De,Ween 153 - ~« Z 

and a d human erythropoietin (F.g. 10 [SEQ , D N0 S: 6 * 7]). When conservative 

ubsftutlons are taken into account, this region of hML shows 50% similarity to 

human erylhrop.ie.ln (hEPO). Both hEPO and the hML contain four cysteines. Three of 

the 4 cysteine* are conserved in hML. including the first and test cysteines. S«e- 

elt 7?T* eXPer ' mentS ^ ,hat the *« " d «« seines of 
ervthropo.etu, form a dteumde bond that is required for function (Wang. F.Fef a/ 

hnrr; 1652286 ' 2292 11983]) - By ^ - - - - 

^ alS ° ,0rm a CrWcal d,sulfid ° bond. None of the glycosylate sites are 
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conserved in hML. All potential hML N-linked m^ n *„, 

IN " nKea 9'ycosylation sites are located in the 
carboxy-terminal half of the hML polypeptide. 

Similar to hEPO, the hML mRNA does not contain the consensus polyadeny,ation 
_ sequence AAUAAA, nor the regu.atory element AUUUA that is present in 3' untrans.ated 

' 72 1" 13 "" Cy, ° kineS iS ,h0U9h, 10 in, ' Uence mRNA stabi »<y (Shawef a/., Cell. 
46.659-667 [,986]). Northern blot analysis reveals low levels of a single 1 8 kb 
hML RNA transcript in both fetal and adult liver. After longer exposure, a weaker 
band of the same size could be detected in adult kidney. By comparison, human 
erythropo,etin is expressed In fetal liver and. in response to hypoxia, the adult kfdney 
and liver (Jacobs et a,., Nature, 313:804-809 [1985J and Bondurant er a/., Mo ,ec 
Cell. Biol., 6:2731-2733 [1986]). 

The importance of the C-terminal region of the hML remains to be elucidated 
Based on the presence of the six potential sites for N-linked glycosylation and the 
ab.l.ty of the iigand to bind lectin-affinity columns, this region of the hML is likely 
glycosylated. In some gel elution experiments, we observed activity resolving with a 
M r around 60,000 which may represent the full length, glycosylated molecule The 
C-terminal region may therefore act to stabilize and increase the half-life of 
crculating hML. in the case of erythropoietin, the non-glycosylated form has full /„ 
wfro biological activity, but has a significantly reduced plasma ha.f-.ife relative to 
glycosylated erythropoietin (Takeuchi et al.. J. Biol. CAem.. 265:12127-12130 
[1990]; Narhi et a/., J. Biol. Chem., 266:23022-23026 [1991] and Spivack et al 
S/ood. 7:90-99 [1989]). The C-terminal domain of hML contains two di-basic 
ammo acid sequences [Arg-Arg motifs at positions 153-154 and 245-246] that could 
serve as potential processing sites. Cleavage at these sites may be responsible for 
generating the 30, 28 and 18-22 kDa forms of the ML isolated from APR 
Significantly, the Arg 153 -Arg 154 sequence occurs immediately following the 
erythropo.etin-.ike domain of the ML. These observations indicate that full length ML 
may represent a precursor protein that undergoes limited proteolysis to generate the 
mature Iigand. 

4. Isoforms and Variants of the Human mpl Ligand 
Isoforms or alternatively spliced forms of human mpl Iigand were detected by 
PCR in human adult liver. Briefly, primers were synthesized corresponding to each 
end as well as selected interna, regtons of the coding sequence of hML These primers 
were used In RT-PCR to amplify human adult .iver RNA as described in Example 10 
•n addition to the ful. .ength form, designated hML. three other forms, designated 
hML2, hML3 and hML4, were observed or deduced. The mature deduced amino acid 
sequences of al. four isoforms is presented in Fig. 11 (SEQ ID NOS: 6 8 9 & 10) 
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hML3 has a 116 nucleotide deletion a nncitinn 7nn . u- l. 

uetenon a position 700 which results in both an amino acid 

deletion and a frameshift. The cDNA now encodes a mature P o.ypeptide that is 265 

am,no acid iong and diverges from the hML sequence at amino acid residue 139 

Rnally, hML4 has both a 12 nucieotide deletion following nucleotide position 618 

(also found .n the mouse and the pig sequences [see below]) and the 116 bp deletion 

found In hML3. Altough no clones with only the 12 bp deletion (following nucleotide 

619) have been isolated in the human (designated hML2), this form is likely to exist 

because such a isoform has been identified in both the mouse and pig (see below), and 

because It has been identified in conjunction with the116 nucleotide deletion in hML4. 

Both a substitutional variant of hML In which the dibasic Arg 15 3-Arg 15 4 

sequence was replaced with two alanine residues and a "EPO-domain- truncated form of 

hML were constructed to determine whether the full length ML was necessary for 

b.ological activity. The Arg 153 -Arg 154 dibasic sequence substitutional variant 

refered to as hML(R153A, R154A), was constructed using PGR as described in 

Example 10. The "EPO-domain" truncated form. hML 153l was also made using PCR 

by introducing a stop codon following Arg153. 
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5. Expression of Recombinant Human mpl Llgand (rhML) in 
Transiently Transfected Human Embryonic Kidney (293) 
20 cells 

To confirm that the cloned human cDNA encoded a ligand for mpl. the ligand was 
expressed in mammalian 293 cells under the control of the cytomegalovirus immediate 
early promoter using the expression vectors pRK5-hML or pRK5-hMLi 53 . 
Supernatants from transiently transfected human embryonic kidney 293 cells were 
found to stimulate 3 H -thymidine incorporation in Ba/F3-mp, cells, but not In 
parental Ba/F3 cells (Fig. 12A). Media from the 293 cells transfected with the pRK 
vector alone did not contain this activity. Addition of mpHgQ to the media abolished 
the stimulation (data not shown). These results show that the cloned cDNA encodes a 
functional human ML (hML). 

To determine if the 'EPO-domain' alone could bind and activate mpl the 
truncated form of hML. rhML 153 . was expressed in 293 cells. Supernatants 'from 
transfected cells were found to have activity similar to that present in supernatants 
from cells expressing the full length hML (Fig. 12A). indicating that the C-terminal 
domain of ML Is not required for binding and activation of c-mpl . 
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6. mp/ Llgand Stimulates Megakaryocytopolesis and 

Thrombopoiesis 
Both the full length rhML and the truncated rhML 153 forms of recombinant 
hML stimulated human megakaryocytopoiesis in vitro (Fig. 12B). This effect was 
observed in the absence of other exogenously added hematopoietic growth factors With 
the exception of IL-3, the ML was the only hematopoietic growth factor tested that 
exhibited this activity. Il_.11, IL-6. IL-1, erythropoietin, G-CSF, IL-9, LIF, kit 
ligand (KL). M-CSF, OSM and GM-CSF had no effect on megakaryocytopolesis when 
tested separately in our assay (data not shown). This result demonstrates that the ML 
has megakaryocyte-stimulating activity, and indicates a role for ML in regulating 
megakaryocytopolesis. 

Thrombopoietic activities present in plasma of thrombocytopenic animals have 
been shown to stimulate platelet production In a mouse rebound thrombocytosis assay 
(McDonald, Proc. Soc. Exp. Biol. Med.. 14:1006-1001 [1973J and McDonald et at., 
15 Scand. J. Haematol., 16:326-334 [1976]). In this model mice are made acutely 
thrombocytopenic using specific antiplatelet serum, resulting in a predictable rebound 
thrombocytosis. Such immuno-thrombocythemic mice are more responsive to 
exogenous thrombopoietin-like activities than are normal mice (McDonald, Proc. Soc 
Exp. Biol. Med., 14:1006-1001 [1973]). just as exhypoxlc mice are more sensitive 
2 0 to erythropoietin than normal are mice (McDonald, et at., J. Lab. din. Med.. 77:134- 
143 [1971]). To determine whether the rML stimulates platelet production In vivo, 
mice in rebound thrombocytosis were injected with partially purified rhML Platelet 
counts and incorporation of 35 S into platelets were then quantitated. Injection of mice 
*ith 64,000 or 32.000 units of rML significantly increased platelet production, as 
25 evidenced by a -20% increase in platelet counts (p=0.0005 and 0.0001. 
respectively) and a -40% increase in 35 s incorporation into platelets <p=0.003) In 
the treated mice versus control mice injected with excipient alone (Fig. 12C). This 
level of stimulation is comparable to that which we have observed with | L -6 in this 
model (data not shown). Treatment with 16.000 units of rML did not significantly 
3 0 stimulate platelet production. These results indicate that ML stimulates platelet 
production in a dose-dependent manner and therefore possesses thrombopoietin-like 
activity. 

293 cells were also transfected with the other hML isoform constructs 
described above and the supernatants were assayed using the Ba/F3-mp/ proliferation 
assay (see Rg. 13). hML2 and hML3 showed no detectable activity in this assay 
however the activity of hML(R153A. R154A) was similar to hML and hMLi 53 
Indicating that processing at the Ar 91 53-^54 dl-baslc site is neither required for 
nor detrimental to activity. 
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7. Megakaryocytopolesls and the mpl Llgand 

It has been proposed that megakaryocytopoiesis is regulated at multiple 
ce„u ar evels (Wi.iiams et a,. j.Cel, Pnysio,.. 110:101 . 10 4 [1982] and Williams 
5 et a,.. Blood CeUs. 15:123-133 [1989]). This is based iargely on the observation 
that certam hematopoietic growth factors stimulate proliferation of megakaryocyte 
progenrtors while others appear to primarily affect maturation. The results presented 
here suggest that the ML acts both as a proliferative and maturation factor. That ML 
stimulates proliferation of megakaryocyte progenitors is supported by several lines of 
evidence. First. APP stimulates both proliferation and maturation of human 
megakaryocytes /„ vitro, and this stimulation is completely inhibited by mp/-lgG 
(Figs. 7 and 8). Furthermore, the inhibition of megakaryocyte colony formation by 
c-mpl ant.sense oligonucleotides (Methia et a/., Blood. 82:1395-1401 [1993]) and 
the finding that c-mpl carr transduce a proliferative signal in cells into which it is 
1 5 transacted (Skoda et al.. EMBO, 12:2645-2653 [1993] and Vigon et a,.. Oncogene 
8:2607-2615 [1993]) also indicate that ML stimulates proliferation. The apparent 
express.cn of cmpl during all stages of megakaryocyte differentiation (Methia et al 
Blood. 82:1395-1401 [1993]) and the ability of recombinant ML to rapidly 
stimulate platelet production in vivo indicate that ML also affects maturation The 
10 availability of recombinant ML makes possible a careful evaluation of its role In 
regulating megakaryocytopoiesis and thrombopoiesis as well as its potential to 
influence other hematopoietic lineages. 



25 



30 



35 



8. Isolation of the Human mpl Ligand (TPO) Gene 
Human genomic DNA clones of the TPO gene were isolated by screening a human 
genomic library in X-Gem12 with P R45. under low stringency conditions or under 
h,gh stringency conditions with a fragment corresponding to the 3' half of human cDNA 
cod.ng for the mpl ligand. Two overlapping .ambda clones spanning 35 kb were 
isolated. Two overlapping fragments (BamH1 and EcoRI) containing the entire TPO 
gene were subcloned and sequenced (see Pigs. 14A. 14B and 14C). 

The structure of the human gene is composed of 6 exons within 7 kb of genomic 
DNA. The boundaries of all exon/intron junctions are consistent with the consensus 
motif established for mammalian genes (Shapiro. M. B.. et al., Nucl. Acids Res 
15:7155 [1987]). Exon 1 and axon 2 contain 5' untranslated sequence and the initial 
four amino acids of the signal peptide. The remainder of the secretory signal and the 
f.rst 26 amino acids of the mature protein are encoded within exon 3. The entire 
carboxyl domain and 3' untranslated as well as -50 amino acids of the erythropoietin- 
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? Tr encoded wi,hin exon 6 The ,our amin ° adds inv ° |ved »• 

observed wfth.n hML-2 (hTPO-2) are encoded at the 5" end o« exon 6 

Analysis of human genomic DNA by Southern blot indicated the gene for TPO is 
present ,n a s.ngle copy. The chromosomai ioca.ion of the gene was determined by 
5 fluorescent in situ hybridization (FISH) which mapped to chromosome 3q27-28. 

9- Expression and Purification of TPO from 293 Cells 

Preperation and purification of ML or TPO from 293 cells is described in detail 
in Example 19. Briefly, cDNA corresponding to the TPO entire open reading frame 
was obtained by PGR using pRKS-hmp/ 1. The PGR product was purified and cloned 
between the restriction sites Clal and Xbal of the plasmid pRKStkneo (a pRK5 derived 
vector modified to express a neomycin resistance gene under the control of the 
thym.dine kinase promote) to obtain the vector P RK5tkneo.ORF(a vector coding for the 
entire open reading frame). 

A second vector coding for the EPO homologous domain was generated the same 
but using different PCR primers to obtain the final construct called pRK5-tkneoEPO- 
D. 

These two constructs were transfected into Human Embryonic Kidney cells by 
the CaP0 4 method and neomycin resistant clones were selected and allowed to grow to 
confluency. Expression of ML 153 or ML3 32 in the conditioned media from these 
clones was assessed using the Ba/F3-rop/ proliferation assay. 

Purification of rhML 33 2 was conducted as described in Example 19 
Br.efly, 293-rhML 33 2 conditioned media was applied to a Blue-Sepharose 
(Pharmacia) column that was subsequently washed with a buffer containing 2M urea 
The column was elated with a buffer containing 2M urea and 1M NaCI. The Blue- 
Sepharose elution pool was then directly applied to a WGA-Sepharose column, washed 
wrth 10 column volumes of buffer containing 2M urea and 1 M NaCI and eluted with the 
same buffer containing 0.5M N-acetyl-D-glucosamlne. The WGA-Sepharose eluate 
was applied to a C4-HPLC column (Synchrom, Inc.) and eluted with a discontinuous 
propanol gradient. By SOS-PAGE the purified 293-rhML 332 migrates as a broad band 
in the 68-80 kDa region of the gel (see Fig. 15). 

Purification of rhML 153 was also conducted as described in Example 19 
Briefly, 293-rhML 153 conditioned media was resolved on Blue-Sepharose as 
described tor rhML3 32 . The Blue Sepharose eluate was applied directly to a mpl- 
affinity column as described above. RhML 153 eluted from the mpAafflnity column 
was purified to homogeneity using a C4-HPLC column run under the same conditions 
used for rhML33 2 . By SDS-PAGE the purified rhML 153 resolves into 2 major and 2 
minor bands with Mr of -18,000-22,000 (see Fig. 15). 
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10. The Murine mpl Ligand 

o b t a in e l DN pr , ^ me^, C ° rreSPOnding ,0 the Codi "° »#" °< the human mp/ligand was 
o ,a,ned by PGR. ge. purified and labeied ,„ » he presence of 32 P . dATP ^ 3 ^ 

lineT ,0 SCfeen C ' 0neS " 3 — *»< ™* ■»* in *GT10. A 

murine clone Fig. 16 [SEQ ID NOS- 13 a iw 

was isolate a „w ]) conta,n,n 9 a 1443 base pair insert 

Ml w a ,H 6nCed ini,iatl '° n C0d0 " at nuc, «**> P0-*» 138- 

(KozsTm , V° nSenSUS S8qUenCe ,aVOraD,e ,0r 6Uk ^ 0,ic <™™" W«-«on 
) eaTo f " 108 ' 229 " 241 [1989)) ' Th ' S « open 

« ^ m r: e ; r nucieotides ' wh,ch ^ * «*— . 

352 am no acds. Flanking ,his open reading frame are 137 nucleotides of * and 247 
uceo des of , un.rans.ated sequence. There is no P o, y( A, «a„ foilowing the * 

7Z^J onimn9 ,hat the c,one is probab,y not The ~ 

of the predated amino acid sequence is highly hydrophobic and probabry represents a 
l9 na peptide. Computer analysis (von H eij ne. Q. Eur. J. Bio J m . 
1983 lnd , cated a potem|a| c)eavage sjte fQr sjgnai peptwase 
and 2 2 . Cleavage a , ^ pos|tion ^ ^ 

acds (35 kDa) identified as mML 331 (or mML2 for reason/ delcld bl.ow) 

r n TrN co r a,ns 1 4 . cys,e,nes ' an conserved in the human —en 

potent.al N- g ,ycos y ,at,on sites, 5 of which are conserved in the human sequence 
Aga,, as with hML, a,, seven po.entia. N-g.ycosy.a.ion sites are located in the C- 

terminal half of the protein. 

and d cTT T Pare<1 ^ ^ hUman ML COnSiderab,e *» "^tide 

and deduced amino acid sequences were observed in the -EPC-domairu, of these MLs 

However, when deduced amino acid sequences of human and mouse ML, were aligned' 
he m 0use sepuence appeared tQ ^ & ^ mU J [ 

llZr* T ^ nUC ' e0Wde de ' eti0n ,0,1 ° Win9 «»• — 

both the human (see above) and pig (see beiow, cDNA's. Accordingly, additional 

ammo acid deduced sequence peptide containing the "missing- tetrapeptide LPLQ 
™ S form is believed to be the ,ul, length murine ML and Is refered to as mML or 
-ML335. The nucleotide and deduced amino acid sequence for mML are provided I 
«g. 17 (SEQ , D NOS: 14 * „>. This cDNA Cone consists of 1443 base pairs 
o owed by a po,y(A) «ai.. I, possesses an open reading frame of 10 68 bp flanked by 

ZT 1 ' ^ " 1 baS6S " 3 ' UntranS ' ated ~ ~ 'nitiat J 
D tt , T 138 - 14 °- ^ ° Pen readinQ f ™ -codes a predicted 

protein o, 356 amino acids, the firs, 21 of which are h.gh,y hydrophobic and likeiy 

function as a secretion signal. 
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Finally, a third murine clone was isolated, sequenced and was found to contained 
the 116 nucleotide deletion corresponding to hML3. This murine isoform is therefore 
denominated mML3. Comparison of the deduced amino acid sequences of these two 
isoforms is shown in Fig. 18 (SEQ ID NOS: 9 & 16). 

The overall amino acid sequence identity between human and mouse ML (Fig. 
19 [SEQ ID NOS: 6 & 17)) is 72% but this homology is not evenly distributed. The 
region defined as the "EPO-domain" (amino acids 1-153 for the human sequence and 
1-149 for the mouse) is better conserved (86% homology) than the carboxy- 
terminal region of the protein (62% homology). This may further indicate that only 
the "EPO-domain" is important for the biological activity of the protein. 
Interestingly, of the two di-basic amino acid motifs found in hML, only the di-basic 
motif immediately following the "EPO-domain" (residue position 153-154) in the 
human sequence is present in the murine sequence. This is consistent with the 
possibility that the full length ML may represent a precursor protein that undergoes 
limited proteolysis to generate the mature ligand. Alternatively, proteolysis between 
Argi53-Argi54 may facilitate hML clearance 

An expression vector containing the entire coding sequence of mML was 
transiently transfected into 293 cells as described in Example 1. Conditioned media 
from these cells stimulated 3 H -thymidine incorporation into Ba/F3 cells expressing 
either murine or human mpl but had no effect on the parental (mp/-less) cell line. 
This indicates that the cloned murine ML cDNA encodes a functional ligand that is able 
to activate both the murine and human ML receptor (mp/). 

11. The Porcine mpl Ligand 

Porcine ML (pML) cDNA was isolated by RACE PCR as described in Example 
13. A PCR cDNA product of 1342 bp was found in kidney and subcloned. Several 
clones were sequenced and found to encode a pig mpl ligand of 332 amino acid resudues 
referred to as pML (or PML332) having the nucleotide and deduced amino acid 
sequence shown in Fig. 20 (SEQ ID NOS: 18 & 19). 

Again, a second form, designated P ML2. encoding a protein with a 4 amino acid 
residue deletion (228 amino acid residues) was identified (see Fig. 21 [SEQ ID NO: 
21J). Comparison of pML and pML2 amino acid sequences shows the latter form is 
identical except thai the tetrapeptide QLPP corresponding to residues 111-114 
inclusive have been deleted (see Fig. 22 [SEQ ID NOS: 18 & 21]). The (our amino 
3 5 acid deletions observed in both murine and porcine ML cDNA occur at precisely the 
same position within the predicted proteins. 

Comparison of the predicted amino acid sequences of the mature ML from 
human, mouse, and pig (Fig. 19 [SEQ ID NOS: 6. 17 & 18]) indicates that overall 
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, the amino-tela, ha^ J Z, ^ ^ '* ~ * — 
84 percent M.nH , . homologous domain). This domain is 80 to 

84 percent idenfca. between any two species whereas the carboxy-terminal half 

*12 T ' Pf0,eaSe ° ,eaVa9e S " e iS Presen « * the «* of m. 

thH . '° 9y d ° main - Th,S ^ iS beh ™ - species a 

h pos.on (Fig. 19 [SEQ ,D NOS: 6, 17 * 1 8] ). A second di-basic site p esen ■ 

P-non 245 and 246 in the human sequence te no, present in the mouse or plg 

sequences. The murine and the pig ML sequence contain 4 cysteines, all conserved in 

he human sequence. There are seven potential N-giycosylation sites within the mouse 

gand and six within the porcine ML, 5 of which are conserved within the human 

h™ Zl " ,he potential N - glycosyla,ion si,es are ,oca,ed ,n - — 

The expression vectors used to transfect cwn ^.n. . 

TP0 1S3) . The pertment features of these plasmlds are presented in Fig. 23 and 24 

The transfectlon procedures are described In Examp.e 20. Briefly cDNA 
correspond^ to the entire open reading frame 0, TPO was obtained by PCR The P^R 
Product was purified and cloned between two restriction sites (C.a. a,d Sa,!> ofThe 
P.asm,d pSVI5.ID.LL to obtain the vector P SV, 5 , D . LL . M LORF. A second cLTc 
corresponding ,0 »e EPO homologous dorna* was generated » way J^J 

EPO 7" Primer(EPOD Sal >- *- -"struct for the vector clg for I 
EPO homologous domain of TPO is called pSVI5.ID.LL.ML£PO-D 

These two constructs were linearized with Notl and tmnsfected into Chinese 
electr T ^7 EP WJM « 15 March ^ 

Z voT^i 1° ce " 8 were e,ec,roporated in a brl ~ 

350 Vote 330 mF. .ow capacitance) ,n the presence of 10. 25 or 50 mg of DNA as 
escribed (Andreason. G.L. J. T,ssue Cu,, Meth . 1 5(56 ,1993]). The cLy ,2,2 
ransfect.cn. ce„s were split in DHFR selective media (High g,ucose 0MEM-F12 
0:50 withou, glycine. 2mM glutamine. 2-5% dtely26 d feta, J serum, 10 to 

coir '"f idUa, I COl0 " i — *-"«- «o * we» plates and a«ow j ,0 grl to 
confluency. Expression of M L 153 or ML332 In the conditioned media from these 
was assessed using the Ba/Fa-mp/ operation assay (descried to E» m p.. .) 
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The process for purifying and isolating TPO from harvested CHO cell culture 
fluid Is described in Example 20. Briefly, harvested cell culture fluid (HCCF) is 
applied to a Blue Sepharose column (Phamacia) at a ratio of approximately 100L of 
HCCF per liter of resin. The column is then washed with 3 to 5 column volumes of 
5 buffer followed by 3 to 5 column volumes of a buffer containing 2.0M urea. TPO is 
then eluted with 3 to 5 column volumes of buffer containing both 2.0M urea and 1 OM 
NaCI. 

The Blue Sepharose eluate pool containing TPO Is then applied to a Wheat Germ 
Lectin Sepharose column (Pharmacia) equilibrated in the Blue Sepharose eluting 
1 0 buffer at a ratio of from 8 to 16 ml of Blue Sepharose eluate per ml of resin. The 
column is then washed with 2 to 3 column volumes of equilibration buffer. TPO is then 
eluted with 2 to 5 column volumes of a buffer containing 2.0M urea and 0.5M N- 
acetyl-D-glucosamine. 

The Wheat Germ Lectin eluate containing TPO is then acidified and C12E8 is 
1 5 added to a final concentration of 0.04%. The resulting pool is applied to a C4 reversed 
phase column equilibrated in 0.1% TFA, 0.04% C 12 E 8 at a load of approximately 0.2 
to 0.5 mg protein per ml of resin. 

The protein is eluted in a two phase linear gradient of acetonitrile containing 
0.1% TFA and 0.04% C12E8 and a pool is made on the basis of SDS-PAGE. 

The C4 Pool is then diluted and diafilitered versus approximately 6 volumes of 
buffer on an Amlcon YM or like ultrafiltration membrane having a 10,000 to 30,000 
Dalton molecular weight cut-off. The resulting diafiltrate may be then directly 
processed or further concentrated by ultrafiltration. The diafiltrate/concentrate is 
usually adjusted to a final concentration of 0.01% Tween-80. 

All or a portion of the diafiltrate/concentrate equivalent to 2 to 5% of the 
calculated column volume is then applied to a Sephacryl S-300 HR column 
(Pharmacia) equilibrated In a buffer containing 0.01% Tween-80 and 
chromatographed. The TPO containing fractions which are free of aggregate and 
proteolytic degradation products are then pooled on the basis of SOS-PAGE. The 
3 0 resulting pool is filtered and stored at 2-8°C. 

13. Methods for Transforming and Inducing TPO Synthesis in a 
Microorganism and Isolating, Purifying and Refolding TPO 
Made Therein 

35 Construction of £ coli TPO expression vectors Is described in detail in 

Example 21. Briefly, plasmids pMP21, pMP151. pMP41, pMP57 and pMP202 
were all designed to express the first 155 amino acids of TPO downstream of a small 
leader which varies among the different constructs. The leaders provide primarily for 
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high level translation initiation and rapid purification. The plasmids pMP210-1, 
-T8, -21, -22, -24, -25 are designed to express the tirst 153 amino acids of TPO 
downstream of an initiation methionine and differ only In the codon usage for the first 
6 amino acids of TPO, while the plasmid pMP251 is a derivative of P MP21 0 .1 in 
5 which the carboxy-terminal end of TPO is extended by two amino acids. All of the 
above plasmids will produce high levels of intracellular expression of TPO in E. coli 
upon induction of the tryptophan promoter (Yansura, D. G. et. al. Methods in 
Enzymology ( Goeddel, D. V., Ed.) 185:54-60, Academic Press, San Diego [1990]) 
The plasmids P MP1 and P MP172 are intermediates in the construction of the above 
1 0 TPO intracellular expression plasmids. 

The above TPO expression plasmids were used to transform the £ coli using the 
CaCI 2 heat shock method (Mandel, M. et al. J. Mol. Biol., 53:159-162, [1970]) and 
other procedures described in Example 21. Briefly, the transformed cells were 
grown first at 37°C until the optical density (600nm) of the culture reached 
approximately 2-3. The culture was then diluted and, after growth with aeration, acid 
was added. The culture was then allowed to continue growing with aeration for another 
15 hours after which time the cells were harvested by conjugation. 

The Isolation, Purification and Refolding procedures given below for production 
of biologically active, refolded human TPO or fragments thereof is described in 
Examples 22 and 23 can be applied for the recovery of any TPO variant Including N 
and C terminal extended forms. Other procedures suitable for refolding recombinant 
or synthetic TPO can be found in the following patents; Builder et al., U.S Patent 
4.511,502; Jones et al., U.S. Patent 4,512,922; Olson U.S. Patent 4,518,526 and 
Builder et al., U.S. Patent 4.620,948; for a general description of the recovery and 
refolding process for a variety of recombinant proteins expressed in an insoluble form 
in E. coli. 

A Recovery of non-soluble TPO 

A microorganism such as E. coll expressing TPO encoded by any suitable 
plasmid is fermented under conditions In which TPO is deposited in insoluble 
"retractile bodies'. Optionally, cells are first washed in a cell disruption buffer 
Typically, about 100g of cells are resuspended in about 10 volumes of a cell 
disruption buffer (e.g. 10 mM Tris. 5 mM EDTA, pH 8) with, for example, a Polytron 
homogenlzer and the cells centrifuged at 5000 x g for 30 minutes. Cells are then lysed 
using any conventional technique such as tonic shock, sonication, pressure cycling, 
chemical or enzymatic methods. For example, the washed cell pellet above may be 
resuspended in another 10 volumes of a cell disruption buffer with a homogenlzer and 
the cell suspension Is passed through an LH Cell Disrupter (LH Inceltech. Inc) or 
through a Mlcrofluidizer (Microfluidics International) according to the manufactures' 
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instructions. The particulate matter containing TPO is then separated from the liquid 
phase and optionally washed with any suitable liquid. For exampie. a suspension of ceil 
lysate may be centhfuged at 5,000 X g for 30 minutes, resuspended and optionally 
centnfuged a second time to make a washed refractile body pellet. The washed pellet 
may be used immediately or optionally stored frozen (at e.g. -70»C). 
B. Solubilization and Purification of Monomeric TPO 
Insoluble TPO In the refractile body pellet is then solubilized with a solublizlng 
buffer. The solublizing buffer contains a chaotropic agent and is usually buffered at a 
basic pH and contains a reducing agent to improve the yield of monomeric TPO. 
Representative chaotropic agents include urea, guanidine-HCI. and sodium thiocyanate. 
A preferred chaotropic agent is guanidine-HCI. The concentration of chaotropic agent is 
usually 4-9M, preferably 6-6M. The pH of the solublizing buffer is maintained by 
any suitable buffer in a pH range of from about 7.5-9.5, preferably 8.0-9.0 and most 
preferably 8.0. Preferably the solubllizing buffer also contains a reducing agent to 
aid formation of the monomeric form of TPO. Suitable reducing agents include organic 
compounds containing a free thiol (RSH). Representative reducing agents include 
dithiothreitol (DTT), dithioerythritol (DTE), mercaptoethanol, glutathione (GSH) 
cysteamine and cysteine. A preferred reducing agent is dithiothreitol (DTT).' 
Optionally, the solubilizing buffer may contain a mild oxidizing agent (e.g. molecular 
oxygen) and a sulfite salt to form monomeric TPO via sulfitolysia. in this embodiment 
the resulting TPO-S-sulfonate is later refolded in the presence of the redox buffer 
(e.g. GSH/GSSG) to form the properly folded TPO. 

The TPO protein is usually further purified using, for example, centrifugation, 
gel filtration chromatography and reversed phase column chromatography. 

By way of Illustration, the following procedure has produced suitable yields of 
monomeric TPO. The refractile body pellet is resuspended in about 5 volumes by 
weight of the solubilizing buffer (20 mM Tris, pH 8, with 6-8 M guanidine and 25 
mM DTT) and stirred for 1-3 hr., or overnight, at 4'C to effect solubilization of the 
TPO protein. High concentrations of urea (6-8M) are also useful but generally result 
3 0 in somewhat lower yields compared to guanidine. After solubilization, the solution is 
centrtfuged at 30.000 x g for 30 min. to produce a clear supernatant containing 
denatured, monomeric TPO protein. The supernatant is then chromatographed on a 
Superdex 200 gel filtration column (Pharmacia. 2.6 x 60 cm) at a flow rate of 2 
ml/min. and the protein eluted with 20 mM Na phosphate. pH 6.0, with 10 mM DTT. 
Fractions containing monomeric. denatured TPO protein eluting between 160 and 200 
ml are pooled. The TPO protein is further purified on a semi-preparative C4 reversed 
phase column (2 x 20 cm VYDAC). The sample is applied at 5 ml/min. to a column 
equilibrated in 0.1% TFA (trlfluoroacetic acid) with 30% acetonltrile. The protein is 
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eluted with a linear gradient of acetonitrile (30-60% in 60 min.). The purified 
reduced protein elutes at approximately 50% acetonitrile. This material is used for 
refolding to obtain biologically active TPO variant. 

C Refolding TPO to Generate the Biologically Active Form 
5 Following solubilization and further purification of TPO. the biologically active 

form is obtained by refolding the denatured monomeric TPO in a redox buffer. Because 
of the high potency of TPO (half maximal stimulation in the Ba/F3 assay is achieved at 
approximately 3 pg/ml), it Is possible to obtain biologically active material utilizing 
many different buffer, detergent and redox conditions. However, under most conditions 
1 0 only a small amount of properly folded material (<10%) is obtained. For commercial 
manufacturing processes, it is desirable to have refolding yields at least 10%, more 
preferably 30-50% and most preferably >50%. Many different detergents including 
Triton X-100, dodecyl-beta-maltoside, CHAPS, CHAPSO, SDS, sarkosyl, Tween 20 and 
Tween 80. Zwittergent 3-14 and others were found suitable for producing at least 

1 5 some properly folded material. Of these however, the most preferred detergents were 

those of the CHAPS family (CHAPS and CHAPSO) which were found to work best In the 
refolding reaction and to limit protein aggregation and improper disulfide formation. 
Levels of CHAPS greater than about 1% were most preferred. Sodium chloride was 
required for the best yields, with the optimal levels between 0.1 M and 0.5M. The 
20 presence of EDTA (1-5 mM) in the redox buffer was preferred to limit the amount of 
metal-catalyzed oxidation (and aggregation) which was observed with some 
preparations. Glycerol concentrations of greater than 15% produced the optimal 
refolding conditions. For maximum yields, it was essential to have a redox pair in the 
redox buffer consisting of both an oxidized and reduced organic thiol (RSH). Suitable 

2 5 redox pairs include mercaptoethanol. glutathione (GSH), cysteamlne, cysteine and 

their corresponding oxidized forms. Preferred redox pairs were 
glutathione(GSH):oxldized glutathione(GSSG) or cys»eine:cystine. The most preferred 
redox pair was glutathione(GSH):oxidized glutathione(GSSG). Generally higher yields 
were observed when the mole ratio of oxidized member of the redox pair was equal to 

3 0 or in excess over the reduced member of the redox pair. pH values between 7.5 and 

about 9 were optimal for refolding of these TPO variants. Organic solvents (e.g. 
ethanol, acetonitrile, methanol) were tolerated at concentrations of 10-15% or lower. 
Higher levels of organic solvents increased the amount of improperly folded forms. 
Tris and phosphate buffers were generally useful. Incubation at 4 "C also produced 
3 5 higher levels of property folded TPO. 

Refolding yields of 40-60% (based on the amount of reduced and denatured TPO 
used in the refolding reaction) are typical for preparations of TPO that have been 
purified through the first C4 step. Active material can be obtained when less pure 
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preparat.ons (e.g. directly after the Superdex 200 column or after the initial 
refract.le body extraction, although the yields are .ess due to extensive precipitation 
and interference of non-TPO proteins during the TPO refolding process. 

Since TPO contains 4 cysteine residues, it is possible to generate three 
different disulfide versions of this protein: 

version 1: disulfides between cysleine residues 1-4 and 2-3 
version 2: disulfides between cysteine residues 1-2 and 3-4 
version 3: disulfides between cysteine residues 1-3 and 2-4. 
During the initial exploration in determining refolding conditions, several 
different peaks containing the TPO protein were separated by C4 reversed phase 
chromatography. Only one of these peaks had significant biological activity as 
determined using the Ba/F3 assay. Subsequently, the refolding conditions were 
opt.rn.zed to yield preferentially that version. Under these conditions, the misfolded 
versions were less than 10-20% of the total monomeric TPO obtained from the 

1 5 solubilizing step. 

The disulfide pattern for the biologically active TPO has been determined to be 
1-4 and 2-3 by mass spectrometry and protein sequencing, where the cysteines are 
numbered sequentially from the amino-terminus. This cysteine cross-linking pattern 
« consistent with the known disulfide bonding pattern of the related molecule 

2 0 erythropoietin. 

D. Biological Activity of Recombinant, Refolded TPO 
Refolded and purified TPO has activity in both /„ vitro and in vivo assays For 
example, in the Ba/F3 assay, half-maximal stimulation of thymidine incorporation 
into the Ba/F3 ce..s for TPO (itat-1 1-153) was achieved at 3.3 pg /ml (0.3 pM) 

2 5 In the mpl receptor-based ELISA. half-maximal activity occurred at 1.9 ng/ml (120 
PM). In normal and myelosuppressed animals produced by near-lethal X-radiation 
refolded TPO (MeH 1-153) was highly potent (activity was seen at doses as low as' 
30 ng/mouse) to stimulate the production of new platelets. Similar biological activity 
was observed for other forms of TPO refolded in accordance with the above described 

30 procedures (see Figs. 25, 26 and 28). 

14. Methods for Measurement of Thrombopoletlc Activity 

Thrombopoletic activity may be measured in various assays including the 
Ba/F3 mpl ligand assay described in Example 1. an In vivo mouse platelet rebound 
synthesis assay, induction of platelet cell surface antigen assay as measured by an 
antiplatelet Immunoassay (antl-GPII b l|| a) for a human leukemia megakaryoblastlc 
cell line (CMK) (see Sato et al., Brit. J. Heamatol., 72:184-190 [1989])(see also 
the liquid suspension megakaryocytopolesis assay described in Example 4), and 
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aw. 72(1) ; 9 p eo ;; e ~ ,as ; ic c (DAM,) (see osura et * 
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15. /n wvo Biological Response of n«™-i ^ - 

8 "»*ponse or Normal and Sublethals 
Irradiated Mice Treated with TPO 

Both normal and sublethally irradiated mice were treat** » 

«*,-*. Udnv <2tt> Boffi forms „ T Z Jl^l f" 

(a) £. co/^hTPO(Met- 1 ,153) 

The -Mef form of the EPO domain (Met in the -1 position plus the firs, 153 
residues of human TPO) produced in , ^ Example 23) £™ 
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normal female C57 B6 mice as described in the legends to Figs. 25A. 25 B and 25C 
These figures show that the non-glycosylated truncated form of TPO produced in £ coli 
and refolded as described above is capable of stimulating about a two-fold increase In 
platelet production in normal mice with out effecting the red or white blood cell 
population. 

This same molecule injected daily into sublethally irradiated (137 Cs } fema|e 
C57 B6 mice as described in the legends to Figs. 26A. 26B and 26C stimulated 
platelet recovery and diminished nadir but had no effect on erythrocytes or leukocytes. 

( b ) CHO-rhTP0332 

The full length form of TPO produced in CHO and injected daily into normal 
female C57 B6 mice as described in the legends to Figs. 27A, 27B and 27C produced 
about a five-fold increase in platelet production in normal mice with out effecting the 
erythrocyte or leukocyte population. 

(c) CHO-rhTP0 33 2; E. co//ThTPO(Mef 1 .153): 293-rhTP0 3 3 2 ; and E. 
5 co//-rhTPOi55 

Dose response curves were constructed for treatment of normal mice with 
rhTPO from various cell lines (CHO-rhTP0 3 32; B. co//-rhTPO (M ef 1 153); 293- 
rhTP0 332 ; and E. co/Z-rhTPOiss) as described in the legend to Fig. 28.' This figure 
shows that all tested forms of the molecule stimulate platelet production, however the 
0 full length form produced in CHO has the greatest in vivo activity. 

( d ) CHO-rhTPOi53. CHO-rhTPO» C |jp P ed- and CHO-rhTP0 332 

Dose response curves were also constructed for treatment of normal mice with 
vanous forms of rhTPO produced in CHO (CHO-rhTP0 153 , CHO-rhTPO-dipped" and 
CHO-rhTP0 332 ) as described in the legend to Fig. 29. This figure shows that all 
5 tested CHO forms of the molecule stimulate platelet production, but that the full length 
70 Kda form has the greatest in vivo activity. 



16. General Recombinant Preparation of mpl Ligand and 
Variants 

Preferably mpl ligand is prepared by standard recombinant procedures which 
involve production of the mpl ligand polypeptide by culturing cells transfected to 
express mpl ligand nucleic acid (typically by transforming the cells with an 
expression vector) and recovering the polypeptide from the cells. However it is 
optionally envisioned that the mpl ligand may be produced by homologous 
recombination, or with recombinanl production methods utilizing control elements 
introduced into cells already containing DNA encoding the mpl ligand. For example a 
powerful promoter/enhancer element, a suppressor, or an exogenous transcription 
modulatory element may be inserted In the genome of the intended host cell in 
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proximity and orientation sufficient to influent *u * 

— is c»: j; os °r — " w - *■* 

th. «,, a , ■ ° e " 9enome - 0ne next *»«ni 'or cells maklna 

the receptor polypeptide of this invention, or for increased or decreased LeL o 

5 expression, as desired. 

insertJ into 7 inVen,i ° n C ° ntemP ' a,eS 3 ** Pf ° dUCin9 ,i9a " d -"P"** 

ant L 96n0me °' 3 C ° n,ainin9 ^ mP ' li9and " UCle ' C «* "»■•«-. a 

ra r, ptl modu)atory ^ ^ ^ 

nuc.e,c ac,d molecuie to Influence transcription thereof, with an optional further step 
compnsmg cutturing the ce.. containing the transcription moratory element and the 
nude* acid molecule. The invention a.so contemplates a host cel. containing the 
md.genous mpl „ga nd nuCeic acid molecule operably linked to exogenous control 
sequences recognized by the host cell. 

A. Isolation of DNA Encoding mpl ligand Polypeptide 

The DNA encoding mpl ligand po.ypeptide may be obtained from any cDNA 
brary prepared from tissue believed to possess the mpl ligand mRNA and to express 
*a detects ,evel. The mpl .igand gene may a.so be obtaJned from a genomic DNA 
-brary or by „ vltro oIigonucleotjde ^ ^ 

acid sequence. ,u 
Libraries are screened with probes designed to identify the gene of interest or 
the prote,n encoded by it. For cDNA expression .ibrar.es. suitab.e probes indude 

Zd T' ° r n P W ? IOnal ant ' b0dieS reC09nbe ^ «*- * «»* mp/ 

gand For cDNA ,-braries suitab.e probes inc.ude oHgonuc.eot.des of about 20 -80 

bases ,n .ength that encode known or suspected portions of the mpl „gand cDNA from 

he same or d.fferent spec.es; and/or comp.ementary or homologous cDNAs or 

ragments thereof that encode the same or a s,m,.ar gene. Approbate 

screening genomic DNA libraries indude, ou, are no, iimited to . 

nSTT thereo ' ** encode * e same or 8 8imi,ar 9 ^ »— «^ 

seZd ^ " 9m6ntS ^ SCfeenin9 CDNA ° r 9 «™ te ■«* ™- 
. «ed probe may be conducted using standard procedures as descrtbed in Chapters 
10-12 of Sambrook et a/., supra. 

m-JT T mean8 t0 iS0 ' a,e enCOdin9 mp ' " 9and ls * - PCR 

methodology as described in sectfon 14 of Sambrook et a,., supra This method 

.3^17,° °! ° ,i90nUC,e ° ,ide PTObeS *- *»' ,o DNA encod.ng the ffl p, 

ligand. Strategies for selection of oligonucleotides are described below 

A preferred method of practicing this invention is to use carefully seiected 
ol-gonucleotJde sequences to screen cDNA iibrar.es from various tfcsues, preferably 
human or porcine kidney (adu.« or feta.) or ,iver eel ,™, For example, human feta. 
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liver cell line cDNA libraries are screened with the oligonucleotide probes 
Alternatively, human genomic libraries may be screened with the o.igonucleotide 
probes. 

The oligonucleotide sequences selected as probes should be of sufficient length 
and sufficiently unambiguous that false positives are minimized. The actual nucleotide 
sequence(s) is usually designed based on regions of the mpl ligand which have the least 
codon redundancy. The oligonucleotides may be degenerate at one or more positions. 
The use of degenerate oligonucleotides is of particular importance where a library is 
screened from a species in which preferential codon usage is not known. 

The oligonucleotide must be labeled such that it can be detected upon 
hybridization to DNA In the library being screened. The preferred method of labeling is 
to use ATP (e.g.. r*2p) and polynucleotide kinase to radiolabel the 5" end of the 
oligonucleotide. However, other methods may be used to label the oligonucleotide, 
including, but not limited to, biotinylation or enzyme labeling. 

Of particular interest is the mpl ligand nucleic acid that encodes a full-length 
mpl ligand polypeptide. In some preferred embodiments, the nucleic acid sequence 
includes the native mpl ligand signal sequence. Nucleic acid having all the protein 
coding sequence is obtained by screening selected cDNA or genomic libraries using the 
deduced amino acid sequence. 
2 0 ft Amino Acid Sequence Variants of Native mpl ligand 

Amino acid sequence variants of mpl ligand are prepared by introducing 
appropriate nucleotide changes into the mpl ligand DNA, or by in vitro synthesis of the 
-desired mpl ligand polypeptide. Such variants Include, for example, deletions from, or 
insertions or substitutions of. residues within the amino acid sequence for the porcine 
mpl ligand. For example, carboxy terminus portions of the mature full length mpl 
ligand may be removed by proteolytic cleavage, either in vivo or In vitro, or by 
cloning and expressing a fragment or the DNA encoding full length mpl ligand to 
produce a biologically active variant. Any combination of deletion, insertion, and 
substitution Is made to arrive at the final construct, provided that the final construct 
possesses the desired biological activity. The amino acid changes also may after post- 
translatJonal processes of the mpl ligand. such as changing the number or position of 
glycosyfation sites. For the design of amino acid sequence variants of the mpl ligand, 
the location of the mutation site and the nature of the mutation will depend on the mpl 
ligand characteristics) to be modified. The sites for mutation can be modified 
individually or in series, e.g., by (1) substituting first wfth conservative amino acid 
choices and then with more radical selections depending upon the results achieved. (2) 
deleting the target residue, or (3) inserting residues of the same or a different class 
adjacent to the located site, or combinations of options 1-3. 
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A useful method for identifieatin« ^ - • 

<+« p., yMP1M . lha , ara : -7 - - - 

«annl„ 9 mu , a9enesls - as ' " l0Ca "°" ! ™«»«™.'» ' s =a,l.d -alanm. 

neoua, o, negaUvety charoed^ ** " """" * "* * • 

man .„ , e „„ M by int , Mu0 ^ ,„„„„ „ ^ » » * 

* * .canning o, «*,, moaosnesis „ „ MUC , M a , 3 

1 s ™ "* " -~ " "» «~ «— 0. ..." 

The,. a„ «o principal vartaMaa ,„ „. «,„«,«,„ 0 , „„„ , aa 
« , ocata „ , na mulafa M ^ ^ 

to arrive at an allele or a variant not found in nature. In general th. i„, a r 

:r _ .„ ^ upo „ me mp , cirr^r 

Amino ac« s „ ue nca daltipn. w«*> m*. >.,»„,„ mUlM , 

irr s " e,, °" 5 ,or " ■* ,sma -» « ^ -she 

wtoarMmm. Hmn. domain. Amino add ,„ u ^ „.,.„„„, J , 

.n. o, m „, , m , nM . rnwu , r t.:™Lr 

^ - — . compnaa « „ „ ore ^ es ;'n : M mt 

:r:~rr:'„rrr:r:rx,rr d ^ 
^'T 0 * wen,,> 10 "• human *" a - «— . mTmirr; 

win b. m„„ „ kaly » modll> W0|((g|ca| aalB|ty m ^ ^ » * 
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significant*. The number of conserve deletions wil, be selected so as to preserve 
th. frt.ry structure of mp, ,igands in the affected domajn 

alpha helix. 

Amino acid sequence insertions include amino- and/or carboxyl-terminal 
fus,ons ranging in length from one residue to polypeptides containing a hundred or 
more residues, as well as intrasequence insertions of single or multiple amino acid 
residues. Intrasequence insertions (i.e.. Insertions within the mature mpl ligand 
sequence) may range generally from about 1 to 10 residues, more preferably 1 to 5 
most preferably 1 to 3. An exemplary preferred fusion is that of mpl ligand or 
0 fragment thereof and another cytokine or fragment thereof. Examples of terminal 
.naertions include mature mpl ligand with an N-terminal methionyl residue an 
artifact of the direct expression of mature mpl ligand In recombinant cell culture and 
fus,on of a heterologous N-terminal signal sequence to the N-terminus of the mature 
mpl ligand molecule to facilitate the secretion of mature mpl ligand from recombinant 
hosts. Such signal sequences generally will be obtained from, and thus homologous to 
the .ntended host cell species. Suitable sequences include STII or Ipp for E. coll alpha 
factor for yeast, and viral signals such as herpes gD for mammalian cells. 

Other Insertlonal variants of the mpl ligand molecule Include the fusion to the 
N- or C-termlnus of mpl ligand of immunogenic polypeptides (/..., not endogenous to 
the host to which the fusion is administered), e.g., bacterial polypeptides such as beta- 
lactamase or an enzyme encoded by the E coll Up .ocus, or yeast protein, and C- 
terminal fusions with proteins having a long halMife such as lmmunog.obul.n 
constant regions (or other Immunoglobulin regions), albumin, or ferritin as 
described in WO 89/02922 published 6 April 1989. 

A third group of variants are amino acid substitution variants. These variants 
have at least one amino acid residue in the mpl ligand molecule removed and a different 
residue Inserted In its place. The sites of greatest interest for substitutional 
mutagenesis include sites identified as the active site(s) of mpl ligand and sites where 
the amino acids found in other analogues are substantially different In terms of side- 
chain bulk, charge, or hydrophobic^, but where there Is also a high degree of 
sequence Identity at the selected site among various mpl ligand species and/or within 
the various animal analogues of one mpl ligand member. 

Other sites of interest are those in which particular residues of the mpl ligand 
obtained from various family members and/or animal species within one member are 
.dentical. These sites, especially those falling within a sequence of at least three other 
identically conserved sites, are substituted in a relatively conservative manner. Such 
conservative substitutions are shown in Table 3 under the heading of preferred 
substitutions. If such substitutions result in a change in biological activity, then more 
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Original 
fi££idue 


TABLE 3 

Exemplary 
Substitution? 


Ala (A) 


Val; Leu; lie 


Arg (R) 


Lys; Gin; Asn 


Asn (N) 


Gin; His; Lys; Arg 


Asp (D) 


Glu 


Cys (C) 


Ser 


Gin (Q) 


Asn 


Qlu (E) 


Asp 


Gly (G) 


Pro 


His (H) 


Asn; Gin; Lys; Arg 


lie (I) 


Leu; Val; Met; Ala; 




norleucine 



25 Ser (S) 



Preferred 
Substituting 



Thr (T) ser 

Tr P (W) T yr 

T V (V) Trp; Phe; Thr; Ser 

Val < v ) He; Leu; Met; Phe; 



Val 
Lys 
Gin 
Gfu 
Ser 
Asn 
Asp 
Pro 
Arg 

Leu 



Leu < L ) norleucine; lie; Val; 

Met; Ala; Phe n e 
(K) Arg; Gin; Asn 

Met ( M ) Leu; Phe; He 

Phe (F) Leu; Val; Me; Ala 

p ro (P) Qly 

Thr 



Arg 
Leu 
Leu 
Gly 
Thr 
Ser 
Tyr 
Phe 



Ala; norleucine i_ eu 



Sttttnbl meditation, in fune,i„n „ Immunol,^, ,d„ % „ ,„„ m „ 
£«« « eut * B reslaues „, ^ |mo 6aMfl J- 
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(1) hydrophobic: norleucine, Met, Ala, Val, Leu, lie- 

(2) neutral hydrophilic: Cys, Ser, Thr 

(3) acidic: Asp, Glu; 

(4) basic: Asn, Gin, His, Lys, Arg; 

5 (5) residues that influence chain orientation: G.y, Pro: and 
(6) aromatic: Trp, Tyr, Phe. 

c. asse ,T n " COnSerVa,iVe SUbStitU,i ° nS Wi " en,a " a ™ mb " °< one of these 

classes for another. Such substituted residues also may be introduced into the 

io -n g(n i: 

orot aas r T emb0dimen ' °' inVen,i ° n ' " iS deSirab,e 10 or more 

pro ea e cieavage sites that are present in the molecule. These sites are identified by 
<ns ec ,on of the encoded amino acid sequence, in the case o, trypsin, for an 

rr.'T residue - when pr ° teasa c,eava9e * 

no h T l ° Pr0te ° ,ytiC C ' eaVa " ^ SUbSt,,U,,n9 the *»~ 
an*her res.due, preferably a basic residue such as famine or a hydrophobic 

res.due such as serine; by deleting the residue: or by inserting a pro y, res'l 
immediately after the residue. 

in another embodiment, any meth.ony, residues other than the starting 

re LI T C 7 ? °' "* ^ — thre 

dues N- or Cermina. to each such methionyi residue, is substituted by another 
e, e (pr e f erab.y in accordance with Tab.e 3, or de.e,ed. Aftematrve,, abou - 
3 residues are inserted adjacent to such sites. 

the J7 a 77* feS,dUeS ^ inV ° ,Ved " m8intainin9 the Pr °^ conte ^n of 

Ibl h ,° ^ " nerally With °*™> to *• «*-*. 

» ,.ity of the mo.ecu,e and prevent aberrant crossing. ,t has been found tha, the 

. and forth cyste.es in the epo domain, numbered from the amino-terminus. « 

127, 11 ™ 9 Pr ° Per COn, °' n,ati0n ^ "* ,he " *W are not 

Accord,ng,. the second and third cysteines in the epo domain may be substituted 

oreoaJr C T ^ *" - "»< «Q- are 

ZZ1 ^ " t TheM ™ th °* «« « no. 

m «ed to, Nation from a natura, source (in the case of nature,, occurn-ng amino 

ac,d sequence vanants) or preparation by oiigonucieotide^ediated (or site-directed, 

:ir 8i8, pcr muta9enesis ' and mute9 ™" - « — *£2 

variant or a non-variant version of mpl ligand polypeptide 

Ollgonucleotide-mediated mutagenesis is a preferred method for preparing 
ubst.tu.on. deletion, and insertion variants of mp , „ ga „d DNA. This technic, e s 2 
known ,n the art as descrtbed by Ade.man ef a/.. DNA, 2:183 ,983,. B Iy 
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ligand DNA is altered bv hvbridirinn *~ r 

c„„ B ,„, n " - - - • »— - 

complementary to th. ♦ nucleot.des that are completely 

1 0 muta ion This e T °" ^ " C °<" n S 'or th 

uranon. This ensures that the oligonucleotide will hvhriHi,. 

Xcatf. Sc/. USA 75:5765 [1978] ' ^ ^ 

Alternatively, single-stranded DNA temolato mo „ 

— — > oha jn^rj^r - * am,,u * 9 

For alteration of the native DNA sannan^ 
» —s, tor example). the -^^l^T^^^ T""" 
temp.ate under suitab.e hybridization conditions A DNA d ' ^ 

usually the Klenow fragment of DNA oolvml , , Po^erlzlng enzyme, 

commentary strand of 1 t , °° l ° Synthes,ze *. 

5 the mutated torn, of th. mol »„..*, 01 DNA ,n,!<,d ~ 
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T e met od described immediacy above may be modified such that a 

ZZT TV Cr6a,ed Wh6rein b ° ,h S,randS - P*"* -tain the 
mutat, n(s . The mediations are as follows: The sing,e-stranded oligonucleotide is 
_ anneaied to the single-stranded template as described above. A mixture of three 

> deoxynbonucleotides, deoxyriboadenosine (dATP), deoxyriboguanosine (dGTP) and 
deoxyribothymidine (dTTP), is combined with a modified thio-deoxyribocytosine 
called dCTP-(aS) (which can be obtained from the Amersham Corporation). This 
m.xture is added to the template-oligonucleotide complex. Upon addition of DNA 
polymerase to this mixture, a strand of DNA identical to the template, except for the 

> mutated bases is generated. In addition, this new strand of DNA will contain dCTP-(aS) 
instead of dCTP. which serves to protect it Irom restriction endonuclease digestion 

After the template strand of the double-stranded heteroduplex is nicked with an 
appropriate restriction enzyme, the template strand can be digested with ExolU 
nuclease or another appropriate nuclease past the region that contains the site(s) to be 
mutagenic. The reaction is then stopped to leave a molecule that is only partially 
single-stranded. A complete doubie-stranded DNA homoduplex is then formed using 
DNA polymerase in the presence of all four deoxyribonucleotide triphosphates ATP 
and DNA ligase. This homoduplex molecule can then be transformed into a suitable host" 
cell such as E. coli JM101, as described above. 

DNA encoding mpl ligand mutants with more than one amino acid to be 
substituted may be generated in one of several ways, .f the amino acids are located 
close together in the polypeptide chain, they may be mutated simultaneously using one 
oilgonucieotide that codes for all of the desired amino acid substitutions. If, however 
the amino acids are located some distance from each other (separated by more than 
about ten amino acids), it is more difficult to generate a single oligonucleotide that 
encodes all of the desired changes. Instead, one of two alternative methods may be 
employed. 

in the first method, a separate oligonucleotide is generated for each amino acid 
to be substituted The o.igonucleotides are then annexed to the single-stranded 
template DNA simultaneously, and the second strand of DNA that is synthesized from 
the template will encode all of the desired amino acid substitutions. 

The alternative method involves two or more rounds of mutagenesis to produce 
the desired mutant. The first round is as described for the single mutants: wild-type 
DNA is used for the template, an oligonucleotide encoding the first desired amino acid 
substttution(s) is annealed to this template, and the heteroduplex DNA molecule is then 
generated. The second round of mutagenesis utilizes the mutated DNA produced in the 
frrst round of mutagenesis as the template. Thus, this template already contains one or 
more mutations. The oligonucleotide encoding the additional desired amino acid 
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substitution(s) is then annealed to this temm*ta - * «. 

«»*. «„ s , rom lhe « ™ T. , ,9S '"" ns ' 01 DNA 
o« „ „. ^ as . Bmplat Z ^7 ™* - ™. 

P " a thlrd round o» mutagenesis, and so on 

: rrr* rna - ™ pcn — * — » - 

«™. .mourn, „, template DNA „ „ SM as £ > 

1 1 :r; , " ia,iv " y ,a,ae ,uami,ies - ■ — ^ - — . 
zz t„ : t™* ° n,v ai me — • - p*-. ta. ». 

nit , n oi a muB "° n * a dna ' «~ »' *• p«»« i. 

droned ,p o, s , lap , hs ^ 0 , mu , alk)n ^ (o 

«- ft. pfasp,,,,. „ ms savlmca can p , |oeatM ^ 

of DNA Pounded oy me p*,„ can ., ^ ,„„„„.,, pcfl £ 

« - P-«on o, ,h, mu,a,,.„ 5p . 0l , ied „ ^ ^ 
fcum. u topi,., copyl ^ „ somml)iU .^.^ y 

If the -tip o, lemplal , „ proaua h exB 

,p::r n zr .rrr -r- m pcr 

yy. mmauons at separate positions can be introduced 

«l.n o-lffere™ n,u,ant „,™ rs <nd „„,„„„ Mo " 
aim u ,,.„.p ush , „ „. „. „. omen , ,„ , ^ (w ^ 

w ,* h « , , , ou 1,16 action mixture is overlayed 

pTJ » r ,a " - «». -oo-o 
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2 min. 55°C 

30 sec. 72»C. then 19 cycles of the following: 
5 30 sec. 94°C 

30 sec. 55°C, and 

30 sec. 72°C. 

end * " " m °" *" i! "» m »» '"«™. cyder 

insertion into a vector. 

Another method for mf ^, Mmls . 0TO .„, | bagod on the 

..cnnfoue described b, „.„ ., „„ a .„., 34:31S J J »» 

the pit^d ,„ 01her veetor) ^ mpl DNA to be „1™ 

,„ , . .* ^ DNA ,o be ^ „ aenwta) . ^ ^ unique 
res,n=„o„ enaonoelease „„ ,^ ^ m> 

' s " ie "° n s " * «»» ■»» ,*** usl „ g th . , B .J. r a 

********** „,„,.„.„„„ m „ ted , 0 |mroduc 

ocat.ons < h ¥ ^ DNA. Alter th. ,. tWelto , M . s „„„ ^ , 

I . plaen,,d, th. pie.* Is cul „ ,„„ ,„ 

..,o„. encoding th. s w 0l m . 

contain, fh. *-«. **** „ ^ hg ^ 

technws. Th,s double*™,* oli^eo*. „ ^ ..^^ ™ 
cassette Is designed lo have 3' end «... =ej»ene. to. 

,, , „ , " ro ™™ 3 =nd 5 ends thai , re compatible »lm the ends of the 

Prized p,„ mW . such «w „ o-, „ ^ 

noweont.in S |h.muaBdn v , lgm d 0NA ^ |UTO , r™s plesmld 

The nocWc add (..,.. CDNA „ gnomic DNA, Mtlv . „ ^ 

•sand potypeptM. „ !„..„„, lnl0 , furfheHlZ 
,.rnp*.«.„ ., „. ^ „ ,„ ^^tSLT^ 

^Pl»««, of for DNA (2) 0) „ nucMc ^ ™ 

oon^rn v*.u. oomp 0 „.„ B „.„„„,„, 0 „ „ ^ » 
expression DNA, and fc hos, o„, „»h .hlch I, I. cmp.*,.. The 
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signal sequence, an origin of reoiicatinn nM 

element, a promoter and a ,™ °' mark6r genGS ' an 

P omoter, and a transcription termination sequence 

0) Signal Sequence Component 

PCypeptide ^ST^ Pr6,9rab,y 8 Sl9na ' S6qUenCe ° r — 
Polypeptide ,„ aen JT " " ^ N ' ,erm ' nUS °' ,h ° matur8 or 
P ypept.de. In general, the s.gna. sequence may be a component of the vector or i, 

-ay be a par, of the m P i ligand DNA that is inserted into the vector ^VhTT 
s,na, se^ence seiected shou.d be one that is recognized a!d ZL^^TT 
' a S1 gna, peptidase, by the host ce... For prokaryotic host ^ * 

and process the native „,p, ligand signal seque J the siqnal s t 090,26 
a proKaryotic signa. sequence selected, f r e^m I Z 7 " ^ 
Phosphatase, penicillinase. , PP , or heat-st^e e ell Cder S "ST"": 
secretion the native signa, sequence may be substituted bv e l th " 
a. P ha factor, or acid phosphatase ieaders thecal ^ ,nV8rtaSe ' 

362.179 pub.ished 4 A pri , 1990) , , he . ^myiase leader (EP 

published 15 November 1990 ?' * deS ° r ' bed ,n W ° 90/13646 

herpes simp.ex gD signa,. SeCr9,0ry ' 9aderS ' *» ex -P'°. *e 

Or/o/n o/ Replication Component 

— i- ~~.:rr: — 

mmrtuutar, cell. aener.ll, m. „ , " *" cto " ln » VKI °" * 

„,,..,* ^ ' !om,,<,,, • , " ,s '« 



Mom expression vectors are •shuttle 1 vectors /. *. 

— - . ,..s, .„. cs, ., jr — 
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organism for ©xpression For 

«.«.- I. ton**,* int0 ' 7" * *"« * £ »•» th. sa™ 

. .c-^z^irr * ,nMn,m - - « ~~ * % 

9.n.™ and insert, 0^,1^ Tr. * B °'™ > " >i """ —*»«°" « - 

«p, . , 0 excfce ihe w ^ ona 

f«9 Selection Gene Component 

-JEST ;ttr v t -* * ! ** n *• — . 

antibiotics or other tnvin* - „ , ... ( ' °° nfer res, stance to 

» -~ Jz^-jriTr m r ,xaw ' - * 

MUM torn complex meal, .„ T" " C "" C< " m, " l,ms ™' 

! ° One examo,.^ 7! e "° 0< " n( ' *»"— *» «■* 

e. M1 ..ia27 „ S82J1 mycopneno „ c ' «" «*** 

5 (19801) 0, hyoromycin SuS d M „«, c« «*«».«M:14aj 

example* „^ n ab '" <*"' Ste/.. 0:410-413 (19851). Th, 

-* or ^rcrr n,om>c ' n <s ' n ' ,wn, • w 

Examples of other suitable selectable markers for mam ., 

concentration of selection agent in th. m«H. , conditions ,n which the 

to a.piif.cation of both ^JZ^T^^r^^ 
polypeptide. Amplification is the process to IT " 9and 

production of a protein critical Z 1 ^ ^ ,0r the 

prote-n cnt.ca. for growth are reiterated In tandem within the 
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chromosomes of successive aeru»rati rt «* ~< 

W e M , 1 ^;i:: g ::; , ';;: , r, a ■* - dhfr - - 

5 metnotroxata (mm . <""»"»™">* in a cuta. meat™ „,„,,„„ 

.1 MIX. Th.s loads to tn, syntnasl. of m u «lpl. cop,., „, lns DHFR 

— - — ON* co-npn.ln. , h . SXW ^ OT X s s : a h 

z rxo rum rr - Bs - - ■» 

.no,*, „,„, „oa„d, DHFR „,„„,,„ „ d ™ 3" 

, ptophotn^ase (APH) „ „. ^ ^ 

">««=, . Kanamycln. nsomyeln. or a4 „. See ^ ^ ^ ^ « 

r'r ,^ o t- ""'•"-*»»* IN* K,no,n,.n «.,..«,.„., 
7.141 [1979], or Tschemper et al.. Gene, 10:157 Mason Th» 

.»r example. ATCC No. 4407* „ PEP4 ., ,.,„„„ a Z„ x ,'„", 

an 71 " S '°" W ' - »«" 

an effeot„e anv,ronm.nl to, oateoling transforation by growth In ,h. .h. , 

are comp te m.n M py ^ ^ ^ ^ ^ 3M26) 

(iv) Promoter Component 

th. hjT " d *"** , " Ua " > COna,n * *»t I. recoonfced „ 

~ - ^,a, n^ « z.^. r; n 

ctamt. ,nd«a. ,„ d const,,*,. 

™™..d ,.v„ o, W „ S o rtpta lram ONA u „d„ « eonTO , P ,„ 
d^. - cond,,^. ..... ,„. pres . ra „ ^ „ , ^ 
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Z Z 1 a lar9e number of promo,ers recogni2ed * a °< 

eTot olT are we " knom These promoters are operab * ,lnked - -* '*«- 

encod.ng DNA by rennoving the promoter from the source DNA by restriction enzyme 
4^ -erting the isolated promoter sequence into the vector. Both the native 
mpl Hgand promoter sequence and many heterologous promoters may be used to direct 
ampliation and/or expression of the mpl ligand DNA. However, heterologous 
promoters are preferred, as they generally permit greater transcription and higher 
y.elds of expressed mpl ligand as compared to the native mpl ligand promoter. 

Promoters suitable for use with prokaryotic hosts include the B-lactamase and 
lactose promoter systems (Chang et al., Nature, 275:615 f1978J; and Goeddel et al 
Nature, 281:544 [1979]). alkaline phosphatase, a tryptophan (trp) promoted 
system (Goeddel. Nucleic Acids Res., 8:4057 [19B0J and BP 36,776) and hybrid 
promoters such as the tac promoter (deBoer et al., Proc. Natl. Acad. Scl. USA 8021- 
25 [1983]). However, other known bacterial promoters are suitable ' Their 
nucleotide sequences have been published, thereby enabling a skilled worker operably 
to (.gate them to DNA encoding mpl ligand (Slebenlist et al., Cell, 20 269 [1980J) 
us.ng linkers or adaptors to supply any required restriction sites. Promoters for use 
in bacterial systems also will contain a Sh.ne-Dalgarno (S.D.) sequence operabiy 
linked to the DNA encoding mpl ligand polypeptide. 

Promoter sequences are known for eukaryotes. Vlrtualfy all eukaryotic genes 
have an AT-rlch region .ocated approximately 25 to 30 bases upstream from the site 
where transcription is initiated. Another sequence found 70 to 80 bases upstream 
from the start of transcription of many genes is a CXCAAT region where X may be any 
nucleotide. At the 3' end of most eukaryotic genes Is an AATAAA sequence that may be 
the signal for addition of the poly A tail to the 3' end of the coding sequence. A.I of these 
sequences are suitably inserted into eukaryotic expression vectors. 

Examples of suitable promoting sequences for use with yeast hosts include the 
promoters for 3-phosphogfycerate kinase (Hltzeman etal..J. Biol. Chem., 255 2073 
[1980]) or other glycolytic enzymes (Hesa et al., J. Adv. Enzyme Reg 7-149 
[1968]; and Holland. Biochemistry, 17:4900 [1978]). such as enolase 
glyceraldehyde-3-phosphate dehydrogenase, hexoklnase, pyruvate decarboxylase' 
phosphofructoklnase, g.ucose-6-phosphate Isomerase. 3-phosphoglycerate mutase 
pyruvate kinase, triphosphate isomerase. phosphoglucose isomerase and 
glucokinase. 

Other yeast promoters, which are inducible promoters having the additional 
advantage of transcription controlled by growth conditions, are the promoter regions 
for alcohol dehydrogenase 2. isocytochrome C, acid phosphatase, degradative enzymes 
assorted with nitrogen metabolism, metallothionein. glyceraldehyde-3-phosphate 
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dehydrogenase, and enzymes responsible form*,, 

vectors and promoters for use in „ "* 93 ' aCt0Se utili2a,ion - Suitable 

* EP 73.657A Y eas , enha eXPr8SS ' 0n ^ dflSCribed in H ««™n 

promoters. enha " CerS a ' S ° are ^ntageously used witn yMst 

virus, fowipox vlrus (UK 221 ' ^ 7 e ;° meS ° f VifU * « Po-Voma 

Adenovirus 2), bovine papl 1 ' ^ * ^ 1989) - a ^ 0 ^ <"* as 
retrovirus, hepatitis- Z r ^ SarC ° ma ^ ^-ga.ov.rus. a 

Promoter, from heat-shock promoters and romt " " '' mmUn< * ,0bu,i " 

with the m P , , igand 8e ' Df J ' ' r0m the P romo '^ normal* associated 

cel. systems. ' ^ *** Pmm0i ™ « with the hos, 

The early and late promoters of me SV40 vim. ~ ■ 

F-ers er a /., /v. ftfr , f 273 113 r 1S7 .,. M ... ° n9in of replication. 

The immediate ear.y promote 'of l h ** ^ 78:739 «^ 2 [1981,. 

as a „,Wd„, E JLT^i^^^ " *™ — 
> s^tem for expressing DNA in mnj^ S 9 *™ A 
vector is disclosed in U.S Patent No s TlT "* Papi "° ma M 8 

^bed in U.S. Patent No £ * ' ? * - ■* «~n * 

[1982, on expressing cDNA enco L ' " ^ ™*°™B 

- - - con,::: 2 : :~r cdna in -« 

Canaan! and Berg, Proc. Acad . ScL USA ^ / " «T 8 ' mP ' eX ^ 
«ne human interferon si gene in cuitured rnouse an Tl, 1 °" eXPre8Si ° n ° f 

*t **L SC. USA. 7S-.Sm.S7sTZT'T >' ^ ^ * * 
sequences in CV-1 monkey Wdnev k J. * re88 ' 0n °' baCtertal CAT 

~, He u 0... * ir: e r 3 :rr:: , : Mtt ' >— 

terminal repeat as a promoter. * ° US SarCOn,a ^ 

M &j/)ancer Element Component 
Transcription of a DNA encoding the mpl iioand of , 
eukaryotes is often increased by insertino anTn errt '° n by hi9her 

are cfc^ng e,ements ^ ^ ^ 

on a promoter to increase Its trans^n,/ = ° m 10 t0 300 bp " ** ac « 
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transcription unit, within an intron /R a n* ■■ , 

Many anhanca, aa,!! " <° S6 °"» « <*» «„.. 4:, 293 [ 198 4„. 

>»>»*, a.,„op,„,r 2 T ™" "° m m " m,n8tan <**"• «■«". 

. *m . .uCr , : r , Typica " v ' ho,, ' ,v "' « - - ■ - 

*• Polyoma a„„a„ J 11 7 W< "" easlou ™ s °'°™«" enhance,, 

<V0 Transcription Termination Component 

anima, T ^ Ce " S P'ant. 

ammal. human, or nucleated cells from other multicel.u.ar organisms) m also 

rmL™ neCGSSary " - — P-n an. 7/ stl L 

th mRNA. Such sequences are commonly available from the 5' and occaslona y 3 
untranslated regions of eukaryotic or viral DNAs or cDNAs tk 0CC * M)f 3 

M Construction ana Analysis o/ Victors 
ConataUon „, su „a ble v , aore ^ 

:::::: r:Tr ~ is ° a,M - 
™ r:; rr wemM * * — - - - • i-^s 

W Transient Expression Vectors 
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™~TJZ^ZT sys,ems ' comprisins a suitab, ° ~ 

e« * Cone, O^as 1 as " ^ * >~ S 

desired biological or physio.ooil ^ SCreen ' n9 °' POlyPeptides ,or 

S Particular, U L in ^ ^" ^ ^ — ^ « 

of mpl ligand polypeptide I k PUrP ° SeS °' «« ^ 

polypepl.de that have mp, ligand polypeptide biological activity 

Ox) Suitable Exemplary Vertebrate Cell Vectors 

no. 5 , 2 S 8l28 7> or pSVISB (PCT P ut , lteall0 „ No . wo o, Ja,,, ' * 

D. Selection and Transformation of Host Celts 

X1776 /*Tpn m } ' h ° U9h 0th6r StrainS 8uch « B. CO//B £ coif 

ir/ 139,383 published 2 Mav iojki 

«uy MW my e „ hosts (U.S. Patent No. 4 943 529, 8ueh „ 

(Louvencourt a/..,. fl ac ,„ /o/ . 737 m 983 H , T ^ * ' actls 

183.070; Sree k „shna et a ,, j. Basic *cJT Z^^TT 
rifcftortfm. re M /a (EP 244 234) Ntuim 28 - 265 ' 27 8 [1988]), Cnd/da, 
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Commun., 112:284-289 [19831- Tilburn a* ,/ o 

Hynos. BMBO J.. 4:475-479 [1985]) (K " I) ' and 

ILl Z JTTr hi9 karyo,io - cul, °" ls *• 
> "ZT /7 s insea hos ' cel,s ,rom hM,! — - 

Plan, oel, outaM of Conor, «,„,, po , afe 

rr T"" 18 " ,o ,he ~ dna - tai— z 

DHFR <CHO, Uta* .no Cha*. ftoc. MM ^ jb- ^ „.„,. * 
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kidney cells (CVt ATCC cri 7n\. a* * 

CRM567); L a „ ^ ^ "** (VER °' 76 ' ATCC 

(»CK, ATCC CCL ~ ** ' HEU ' ATC ° C ° L * — - 

VL M '< t5u ' ,al ° rat liver cells (BRL 3A ATCC em iaah u 
cells (W138, ATCC CCL ?«• h . ,: human ,un 9 

' ^ ^ 7S ). human liver cells (Heo G2 HR fin««. ~ 
5 tumor (MMT 060562 ATCC en to, ^ m ° USe mammar y 

383-44 «n r 1Q »o, ); TR ' C8l,S (Ma,her et N.Y. Acad Sci 
383.44-68 [1982]); M RC 5 cells: FS4 ceils: and a patoma line 

Host cells are transfected and preferably transformed with the above-described 

™ ::r n9 vectors * th ,ion and * — 

1 0 lZZ£t aPPr0Pfia,e indUC ' n9 PfOm0,erS - «™<~. or 

amplifying the genes encoding the desired sequences. 

Transtech refers to the taking up of an expression vector by a host cel. 

. . rr rr.nr.r~ - - — - 

- r; - ~r^:r^-:~ 

^"tir r r- o, "* s3:3 ' 5 1,9831 - m w ° 8 ^ 

»» «.».»,. crfaurn phMpha „ pr9cipitatk)n ^ ^ „„ „„ 

rsrrT " 97si i! p,,,wrM ' 6,n "" >™* * ~" - £ 

./..en, u a n. f om» MTO hare p ., n ,,„„,„„, ^ <* 
««d tt A ugus , ,« T ra n.f.,™„ on , ,„,o „„, ar . „ ' • 38 "' S 

Won «y ala, bo US e«. ' I**"™**- « 

£ Cufturing the Host Cells 
5 Prote-yotte e* M to pre<iue . „» , ^ „,„„,„»,„ „ „, 

... « „ ^ meto as ^ ^ h ^ M «2„ 
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(Sigma), Minimal Essential Medium mwFMi e - 

Du.becco, Modilied ZWsHeZ ^m^ 9 ! 1 RPMM64 ° ^ "* 

host Ce .,s. ,„ addition, any of the 1^ " ™ ^ ** CUttU "' n9 ' h ° 

58:44 [1979], Barnes and S >1 « * ^ "* ^ ^ 

> 4,767,704- 4 657 866 .1 ^ 1021255 U S ' ^ N «- 

87/00 19 5- US p aent ' ' 927,762: " 4 ' 560 ' 655: W ° 90 '°343 0; WO 

07/592 14 botn n 7 °' 9fl5: " C ° Pendin9 U - S " S - N - 07/5S *.107 or 

07/592.141, both f,| ed on 3 October 1990. the disclosures of all of which ar* 

~ d : erein ^ re,9r8nCe ' ^ " ^ " "~ * * * < le Any 

c o s I ? ^ 59 SUP ~ 38 — - *°™es and/or other gol 

odl h I" ' ° f 6Piderma ' — as 

um chionde. ca.cum. magnesium, and phosphate,, buffers (such as HEPES) 

nu*~*. (such as adenosine and thymidine), antibiotics (such as G en,a m yc.n~ 
drug), trace eiements (defined as inorganic compounds usua.ly present Tina. 

r ::r s in the m,cromo,ar -™ " d *— - - — ^ r 

2 : ~:r T pl : ments may a,so be inc,uded * ™ ~ 

hat would be known to those skilled in the art. The culture conditions such as 
temperature, pH, and the ,i ke , are those previously used with the host cel. seZe* Z 
express.cn, and wi,. be apparent to the ordinarily skilled artisan 

The host cells referred to in this disclosure encompass cel.s in ,n vttro culture 
as well as cells that are within a host animal. 

F. Detecting Gene Amplification/Expression 

Gene amplication and/or expression may be measured ,„ a sampte directly 
or xam p ,e, by conventions Southern b.ott.ng, northern biotting to <££2 

seer;: 0 Lr\: homas ' proc - Nau - ■* 

[1980]). dot b.ott.ng (DNA analysis), or in situ hybridization uai „„ 
~y *be,ed Probe, based on »e seances pro^^aZ 1*" 
may e employed, most commoniy radioisotopes, par.icu.arly 32 P . Howev " * * 
ec n, q ues may also be empioyed. such as using biotin-mod.ed nlZe 
introduction into a polynucleotide. The biotin then serves as ,he si,e fo^L to 
av,d,n or antibod.es, which may be labe.ed with a wide varle* te 1 h 

employed that can recognize specific duplexes. Including DNA duolexes una h 1 

tzz jet- - d ~ ^ 



«* . W^.^, stainln9 0 , ^ ^ ^ 
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body fluids, to quantitate directlv th« a 

a p,Mu °'- w,,h 

delation .„„ , iMlion , " * fe '"P"*. typlcrtly by 

g« *«m coup,., 'Jrn, UT" * b6 " M " ■» 



further below 

G. Purification of mpl ligand Polypeptide 



Mpl ligand preferably is recovered from the cuttum 
1 5 polypeptide, although i, also may be recovered from hJ T " * ""^ 
expressed without a secretory signal ^ *«* 

~: ;~ 7 o,her ,han ° ne - *~ 

However, i, is sti.l usuaily necessarv to I " °' hUman °^ in - 

20 ce.. proteins or po.ypepZs To 1 ^ ^ ^ 

aeneous as ^ZZZrTl~Z *" 

'3 centrifuged to remove parLate e T debris Z **" °' *— 

actions are then separated A, fB , ^ Protein 

concentration «„er ( T Wo 2 " ^ 

» - The mPl ,1^:: : r ,tra,ion - - 

the membrane fraction of ,he cuS^e Zl™ ^ "* ^ 

-mbra.e bound. Mp( ligand thereat pu Z ^ * 

and polypeptides bv saltinn OIlt a ^ / contammant soluble proteins 

0 and others common to protein purification J T ^ ^ Ce " U, ° M 
suitable for protein pulcIL 7 , ! " ** procedures 

Mab), .ceptlaZJt"^^ ^ 
interaction chromatography (NIC) f. „ ,h ! SePhar0M) ' "^'Phobic 
chromatography < J Con A 2 h " * Ph "* T ° y0pear, >' 

» <•*. SepnLx ^ ^ *~**>»*»~>. *. exclusion 
carboxymethy, and ^Z:^^^ 9 ' T " 
-ipuid chromatography (RP. HPL C) (see uSZT^T^ ^ 
[1984] where two sequential RP-HPLC ,t n „! ^marog., 296:171 

q HPLC steps are used to purify recombinant human 
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IL-2). Other purification steps optionally include- .th-n i • 
preparation of a mot iinann * • venation. For example, 

tain, ' ,1 1 „ ° " > " '° " "» 

modification to account for chanaes in JT * ^ may requ,re 

egression in recom^ leT™ ^ °' °» *" « * ««" upon 
W. Covatenr Modifications of mpl ilgand Polypeptide 

>-*. up „ Jl^L ' * ' "* , ' , '' ,m ' """ «•« 

fragments thereof are introduced into »h„ , . P Qand or 

residues of the mp, ^ by te ' 9eted acid 

^ : £::= rrrr; rtrxr- 
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5 .5. ? .o?zr:r;:: :;:,r; * ~" - <-*—. .. * 

«,*», cacody.a.e at p H 6 .„. ' " Pre '"" l> " * 0.1M 

™w«ng , he ch8r9 , 0 , |ys|ny| resWu>s ^ ol 

Slypxvl.,, P*"""""'"... and ,r a „ saml „ a „. eataly2 , a ^ 



ftrAllr , c -u 9 pK a 0< fluanidine functional 

group. Furthermore, these reagents may react with the groups of , ysin e as we„ as the 
arginine epsilon-amino group as the 

«h,..3.(4. a z. r i a . 4 ,« lnlell , ylp . nw)ca Furtharmora !. 

mp/ llgand antibodies, and Wc* „ Common! 1 PUf1fy,n9 anl '- 

, , feI ,^ Commonly used crosslinking agents include a o 

1.1-bis(dia 2 oacet y l)-2. p hen y iethane, glutaraldehyde N L 
3 5 esters. ,or examp.e. esters with 4-azidosa.icy.i o^T^T^ ^ 
•ncuding dteuccinimidy. esters such as Ttml 

—a, ma.e.m.des such as ^^^^^^ 
such as -hy I -3- ((p -a 2 ido P hen yl) d i th,o ]P ro Pi o i m i da t e yS^^H 
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" ,h " »' c, MSII „ to in „,. prejenc . 0) „ 0 „, 

3 e r»T4 ,ea0 " V ' ! " 6S, ' a * S *"*•" " U a p »« 

. zr::: 4 ^ - — ^ * 

fans within the scope of this Invention. 

10 „ho h °* er m ° di " Cati0nS inc,ude ^^oxylation of proline and lysine 
Phosphoryiation of hydroxy, groups of sery. or threony, residues, me.hy.at.on of ,he 
a-am.no groups of lysine, arglnine. and hist.dine side chains (T.E. Creighton 
Prote,ns: structure an* MCecu.ar Property W.H. Freeman * Co.. San Frandsco' 

15 term T °' *" d of any C 

1 5 terminal carboxyl group. 

within ,hT 6r ^ K f C ° Va,ent m ° di,iCa,,0n °' *• WW* M«« 

o , e 1 T- inVent ' 0n C ° mPrteeS a,teri " 9 ^ nat,VS Pa«em 

of the po.ypept.de. By a.tering is meant de.eting one or more carbohydrate moieties 

found ,n naave mp, ilgand. and/or adding one or more g,ycos*a,ion sites that are I 
2 0 present in the native mpl ligand. 

Glycosylate of po.ypept.des Is typicaHy either N-linked or O-.inked N- 

s IT 8 'IT a,taChmem °' Carb ° hydrate 10 *° «* - an 

asparag-ne residue. The tr,pep«.de sequences asparag.ne-X-ser.ne and asparagine-X- 

threon,ne. where X ,s any amino acid except pro.,ne. are the recognNon sequences for 

25 enzyme attachment of the carbohydrate moiety ,o the asparagine side chain. Tn 

the presence of either of these tr.pep.ide sequences In a polypeptide creates a po.en.lal 

SZTT . °* ,inked 9,ycosy,a,ion re,er8 ,o ,he atta *™< - - « £Z 

se" 7 t ' 9a,aCt08e ' ° r *~ to 8 hydroxyamino -* — — 

30 i r n 7' aW,0U9h 5 - hydr0Xypre »- * 5-nydroxyrys.ne may also be used 

uxJZ? Z! 00 ^ 0 " SrtM 10 ^ mf " " 9and * ~-en.en.Py 

.he ' 9 *• &m{n ° 8eqUenCe 8UCh "* « ^.ns one or more of 

Itn Tk tr,PeP,We 8eqUe " CeS (, ° r N -" nked Q,yC0SVla «'°" «->• ^ 
Mton may also be made by ft. add.on of. or subst^n by. one or mor serine 

s °2T T 10 ,he na,ive mpl " 9and sequence *»■•«" o*"*— 

>5 s«es>. For ease. fte mp, Ugand amino acid sequence .s preferabiy altered through 
changes a. fte ONA .eve.. par.lcu.arty by muting .he DNA enood^g fte mp , ,1 1 
Por.pept.de a. prese.ec.ed bases such .ha. codons are generated fta.l j££ 
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the desired amino acids. The DNA mutation(s) may be made using methods described 
above under the heading of "Amino Add Sequence Variants of Ligand ■ 

Another means of increasing the number of carbohydrate moieties on the mp, 
l-gand , by chemical or enzymatic coupling of glycosides to the polypeptide.' These 
5 proce ures are advantageous in that they do no. require production of the polypeptide 
« a host cell that has glycosylate capabilities for N- or Clinked glycosylate 
Depending on the coupling mode used, the sugar(s) may be attached to (a, arginine and 
h.st,d,ne. (b) free carboxyl groups, (c) free su.fhydry. groups such as those of 
cysteme. (d) free hydroxyl groups such as those of serine, threonine, or 
1 0 hydroxyproline, (e) aromatic residues such as those of phenylalanine, tyrosine or 
tryptophan, or (f) the amide group of glutamine. These methods are described in WO 
87/05330 published 11 September 1987, and in Ap.in and Wriston. CRC CrH. Rev 
Blochem., pp. 259-306 [1981]. 

Removal of carbohydrate moieties present on the mpl ligand polypeptide may be 
accomplished chemically or enzymatica.ly. Chemical deglycosy.atlon requires 
exposure of the polypeptide to the compound trifluoromethanesutfonic acid or an 
equ,valent compound. This treatment results in the cleavage of most or all sugars 
except the linking sugar (N-acetylgiucosamine or N-acetylga.actosamine). while 
leaving the polypeptide Intact. Chemical deghycosylation Is described by Hakimuddin et 
2 0 a/.. Arch. Blochem. Blophys.. 259:52 [1987] and by Edge et a,., Ana,. Blochem 
118:131 [1981]. Enzymatic cleavage of carbohydrate moieties on polypeptides can' 
be achieved by the use of a variety of endo- and exo-gfycosidases as described by 
Thotakura et a/.. Meth. Enzymol., 138:350 [1987]. 

Glycosylate at potential glycosylation sites may be prevented by the use of the 
25 compound tunicamycin as described by Duskin et a/.. J. Biol. Cnem., 257:3105 
[1982]. Tunicamycin blocks the formation of protein-N-glycoside linkages 

Another type of covalent modification of mpl ligand comprises linking the mpl 
-.gand polypeptide to one of a variety of nonprotetoaceous poiymers. e.g.. po^thyiene 
g*ool. polypropylene glycol, or po.yoxya.ky.enes. in the manner set forth in US 
30 Patent Nos. 4.640,835; 4.496.689; 4,301.144; 4.670.417; 4.791.192 or 
4.179,337. Mpl ligand polypeptides covalently linked to the forgoing polymers are 
refered to herein as pegylated mpl ligand polypeptides 

« will be appreciated that some screening of the recovered mp, ligand variant 
will be needed to select the optima, variant for binding to a mp, and having the 
immunotoglcal and/or biological activity defined above. One can screen for stabifity in 
recombinant cel. culture or in plasma (.*.. against pro.eoiytic cieavage), high 
affmlty to a mp, member, oxidative stability, ability to be secreted in elevated yields 
and the like. For example, a change in the immunological character of the mp, ligand 
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polypeptide, such as affinity for a aivan ontik ^ . 

immunoassay. Other poten ia. mod T " m6aSUred * 

as redox or therli . ™ Mc!iU ™ °' P*Wn or polypeptide properties such 

5 

^ Polyclonal antibodies 
1 0 raised • PO ' yd0M, an,ib ° d ' eS * ^ P ° ,VP9PMeS 0r *«*»•"«■ «• S^raily 

rrrrr mump,e subcutaneous (sc) ° r »> - 

he mp, | ig and and an adjuvant. ., may be usefu. to conjugate the mp , llgand or a 
ragmen, containing the targe, amino acid sequence to a protein that immunogenic in 
he spe C to be immunized, e. 9 , key ho,e iimpet hemocyanin. serum aibumin, bovine 

»«H,no ge n.c con,uo.le, o, IMw. by combining , mg „, , „ „. ° ' 

TIT T T" *" S °"""' n """""'"^ " — • °" -* e 

n F,. u „ * con , plete Mlllvanl „ ^ >t j W • 

« m . ^ are M an, „ raum „ ^ ,„ ^ 

i' »rr t uni " ,Be a,er 

.T" . -~ *"* 6UI 00, * , ' ,,M " ' — p— 

enhance the immune response. 
3 0 ffil Monodonai antibodies 

Monoclonal anHbod.es are obtained from a population of substantially 
homogeneous ant.bod.es, the individual antibodies comprfcing the population! 
" M ^ » P088,bte "«* occurring mutations tha, may e presTn, " 

35 a tLT° UntS ' ^ • m0n0C,0na, ■ the character o , «, 

3 5 antibody as not being a mixture of discrete antibodies. 

m*. 7 T P ' 8, ,i9and m0n0Cl ° nal 8ntib0die8 °' *• may be 

made using the hybridoma method first described by Kohler & M.istein. 
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ppT 9 : °r O (ao '"" 9 ' M °"° eto,a ' ~ 

pp.aa-103 [Academic Press, 1986]). 
1 0 The hybridoma cells thus prepared are seed** „ 

medium that D r«f.»h. . • d 9rown in a sultable cult ^e 

:r :.:r ~ ~ ^xts: 

Preferred myeloma cells are those that f.,e* ««• - 
I-, expression o, by ^ *" "* ""*"'" — 

.om.,s av,.,L ",.! ,'„ tl T, " H ° PC ' 2 ' - ««•. 

USA L. . "! C "' D ' S ""'"" 0n S '" D »=°. 

i «„, , ^j^,ToJr eTi - h — ~— 

The binding affinity of the monoclonaJ antibody can for ««m nIa k w 
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» P V. gel electrop„„ r9s , s , aialy!l! . „ ^ civom.t.g,aph y . 
„o,„ 7 9 m ™"° m ' W »» ™«» » '•«% isolated and 

ce l. ., my**,. cell! thal „ 0 not .me™,,,. ^ imm ^ tuM ? ,„L J 
l.« toy** „.„«,.„., antibodies * „ ^ ..J™ 

se.uenc. to, a nOT-lmmmjoloQulin polypeptide. ' 

For diagnostic applications, the antibodies of the invention typically will be 

of producing, either dlrectfy or indirectiy. a detectable signal. For examoleT 
detectable molety may be a radioisotope, such as 3 H . u C 32 P s s 7 

IZT ~ «* as 'fluoresce; 

modamlne. or iucfferin; radioactive .ectopic labeis. such as, e.g.. 125, 3 2p ^ „ 

H or an en^e. such as a, ka ,ine phosphatase, beta^ctolse or hceradJ 
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Any method known in the art - 
H--rt.hu. • „ separately conjugating the antibody to the 

t ^ emP, ° yed ' inC ' Udin9 ,h0S8 me,h0ds by Hunter. 

a,., A/a^re. 144:945 [1962]; David. e , S/ocWs fry. 13:1014 [1974]- Pain 

5 30:407 [1982]. y9ren ' W,stoc/wm - Cytochem.. 

m e ,h 0 / he r ib0dieS °' Pr8S8nt ' nVenti0n may be emp, °y ed in a "V known assay 
method, such as competitive binding assays, direct and indirect sandwich assays and 

-mmunoprecipitation assays. Zoia. MonoCona, Annies: A Menus, of Tecnn aues 
PP.14M58 (CRC Press, Inc.. 1987). 
1 0 Competitive binding assays rely on the ability of a labeled standard (which may 

be a mpl | Igan d or an immunologically reactive portion thereof) to compete with the 
test sample analyte (mp, ligand) for binding with a limited amount of antibody The 
amount o, mp, iigand in the tes, sample is inversely proportional to the amount of 
standard that becomes bound to the antibodies. To facilitate determining the amount of 
standard that becomes bound, the antibodies generally are insolubilized before or after 
•he competition, so that the standard and analyte that are bound to the antibodies may 
conveniently be separated from the standard and analyte which remain unbound 

Sandwich assays involve the use of two antibodies, each capable of binding to a 
different immunogenic portion, or epitope, of the protein ( mpl ,igand) to be detected 
•n a sandwich assay, the test samp.e analyte is bound by a first- antibody which is 
-mmoNized on a solid support, and thereafter a second antibody binds to the anaiyle 

*™Z'V n inS0,Ub ' 9 ,hfee ^ C ° mP,6X - DaV ' d & Q "™' US - P *en, No.' 
4,376.110. The second antibody may itself be labeled with a detectable moiety (direct 

sandwich assays, or may be measured using an anti-immunogiobulin antibody that is 

fcbeled w,«h a de.ectab.e moiety (indirect sandwich assay). For example, one type of 

sandwfch assay is an ELISA assay. h which case the deteCabie moiety is an e^yme 

(e.g., horseradish peroxidase). 

(Hi) Humanized and human antibodies 

Methods for humanizing non-human antibodies are we., known in the art 
Generally, a humanized antibody has one or more amino add residues introduced into it 
from a source which Is non-human. These non-human amino ac* residues are often 
referred to as 'Import- residues, which are typically token from an -import- variable 
domain. Humanization can be essentially performed following the method of Winter 
and co-workers (Jones et a/.. Nature, 321 :S22-525 (1986); R.echmann eta, 
^ 332:323-327 [1988]; Verhoeyen et a,, Science, 23,:1 534-1 536 
[1988],. by substituting rodent CDRs or CDR sequences for the corresponding 

ZT VT™ an,ibody - Accord ^' 8uch " humanized " ■*»*• - 

antibodies (Cab.lly et a/., supra), wherein substantia.* .ess than an Intact human 
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vanabie doma-n has been substituted by the corresponding sequence from a non-human 
spaces. In practice, humanized antibodies are typically human antibodies in which 
some CDR res-dues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 

The choice of human variable domains, both light and heavy, to be used In 
mak.ng the humanized antibodies is very important In order to reduce antigenicity 
Accordmg to the so called "best-fit" method, the sequence of the variable domain of a 
rodent antibody is screened against the entire library of known human variable domain 
sequences. The human sequence which Is closest to that of the rodent is then accepted as 
the human framework (FR) for the humanized antibody (Sims et al., J. Immunol 
151:2296 [1993]; Chothia and Lesk, J. Mol. Biol., 196:901 (1987]). Another 
method uses a particular framework derived from the concensus sequence of all human 
anybodies of a particular subgroup of light or heavy chains. The same framework may 
be used for several different humanized antibodies (Carter etal., Proc. Natl Acad Sol. 
USA, 89:4285 [1992]; Presta etal., J. Immnol., 151 :2623 [1993]). 

It is further important that antibodies be humanized wfth retention of high 
affmrty for the antigen and other favorable biological properties. To achieve this goal 
according to a preferred method, humanized antibodies are prepared by a process of' 
analysis of the parental sequences and various conceptual humanized products using 
three dimensional models of the parental and humanized sequences. Three dimensional 
.mmunoglobulin models are commonly available and are familiar to those skilled in the 
art. Computer programs are available which illustrate and display probable three- 
dimensional conformational structures of seiected candidate immunog.obuiin 
sequences. Inspection of these displays permits analysis of the likely role of the 
residues in the functioning of the candidate immunoglobulin sequence, ie., the anafysis 
of residues that influence the ability of the candidate immunoglobulin to bind its 
antigen. In this way. FR residues can be selected and combined from the consensus and 
Import sequence so that the desired antibody characteristic, such as increased affinHy 
for the target antigen(s). is achieved. In general, the CDR residues are directly and 
most substantially Involved in influencing antigen binding. For further details see U S 
application Serial No. 07/934.373 filed 21 August 1992. which is a continuation* 
m-part of application Serial No. 07/715.272 filed 14 June 1991. 

Alternatively, it is now possible to produce transgenic animals (e.g mice) 
that are capable, upon immunization, of producing a full repertoire of human 
antibodies In the absence of endogenous Immunoglobulin production. For example it 
has been described that the homozygous deletion of the antibody heavy chain Joining 
reg,on <J H ) gene in chimeric and germ-line mutant mice results in complete 
inhibition of endogenous antibody production. Transfer of the human germ-line 
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immunoglobulin gene array in such n*™ r 

^°°«*™ M JJS m ZZ ™ an « m ' ee wl " r,su " in 

rr rr p,mucm ,n *~ — ™ 

M«. B»l. 227. 381 (,891|; M.rts .,„ j. „„,. eiol . K2 5S , 

f/v; Bispecific antibodies 

antibod^Tn an, ' b0dieS m0^0C,Ona, ' Pre ' erably human - "un-ntod, 

1 0 Tla ino b T bhdin9 SPeCi " C,UeS ^ ^ »° ^ "*»«■ — * 

1 0 for making Specific antibodies are known in the art. 

Traditionally, the recombinant production of bispecific antibodies Is based on 
he coexpression of two immunoglobulin heavy chain-light chain pairs, where the two 

LLin"^ di<ferent SPeC ' ,iCitieS (M "' Stein CUe " 0 ' ^'-.305:537- 
539 [1983]). Because of the random assortment of immunoglobulin heavy and light 

chains these hybridomas (quadromas) produce a potential mixture of 10 different 

ant-body mCecu.es. of which only one has the correct bispecific structure The 

^cation of the correct molecuie. which is usually done by affinity chromatography 

" 7 athe ; r r berS ° me - and PTOdU « - - S.mi.ar procedures are 
d.sciosed ,„ PCT pubiication No. WO 93/08829 (published 13 May 1 993 ). and In 
Traunecker ef al., EMBO. 10:3655-3659 [1991]. 

domaJTtK 9 h° 8 di,ferent ^ Pr6ferred approach ' variable 
ZT Wndin9 SPeCmCiMeS < an,l ^<9" combining sites) are 

fused to .mmunogiobuiin constant domain sequences. The fusion preferably ,s with an 

rrrr heavy chain com ^ —** - - - 

•he fuslo s . DNAs encoding the immunog.obu.ln heavy chain fusions and. . desired £ 

~ d": T ^ ^ ,nSSrted ,nt ° — " vectors. ^ 

co ransfected .nto a suitable host organism. This provides for great flexIbHrty , n 
adjusting the mutua, proportions of the three po.ypep.Ke fragments h emblH 

" *° *~ Chah8 — * ^nstrucaoTpZ 

f. optonum y ,e Id s. „ however, poss. b ,e to User, the coding sequences for iTo 
M three polypeptide chains h one expression vector when the express.cn o, a. 1 
PC.ypept.de chains , equa. ratios results h high yiek* or Z the ra«oe J " 

abod es are composed of a hybrid tamuno*** heavy chain w.,h a first Ling 
pec.fic.ty ,n one arm. and a hybrid immunog,obu..n heavy cha.n...gh, cha.n pa. 
(providing a second binding specificity) in the other arm. „ was found that «* 
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z m rjTr " oi " a,es *° s,Mr,,,on °' - *— — * 

M Heteroconjugate antibodies 

3 Hat™ He,er ° COniU9ate an,ib0dies are als ° <■»* «"« scope of the present invention. 
He eroconjugate antibod.es are composed of two cova.ent.y joined antibodies. Such 
an bo , es h lor examp(e> befln proposed to ^ jmmune system ce(|s to 

cells (U.S. Patent No. 4,676.980). and for treatment of HIV infection (PCT 

:rr nos j° 91/00360 and wo 92/oo373: ep ° 3 ° 88 >- 

^ m8de USi " 9 COnVenient ere -*'*» cross- 

rjT t known in ,he art - and are disciosed ,n u - s - p — 

4,676,980. along w.th a number of cross-linking techniques. 

.V. Tnerapeutic Use of the Mega.aryocytopo.et.c Protein mp , Llgand 

referr J, \ 'unction and 

referred to here as a megakaryocyte or thrombocytopoietic protein (TPO) may 
be used ,n a sterUe pharmaceutioa. preparation or formulation to st.mu.ate 
-gakaryocytopoietic or thrombopo.et.c activity ,„ patient8 suffer(ng * 
rom ocytopenia due to impaired production, sequestration, or increased destruction 
of platelets. Thrombocytopenia.assoc.ated bone marrow hypoplasia (eg aplastic 
«-* .Cowing chemotherapy or bone marrow tra.p.nt, ma'y be effeltrfe, ^ 

.nil 7*°"* * *" ,nVen, ' 0n M W8 " M « u * « «~d 

~l°:TT n (D,C> ' ' mmUne thr0mbOCyt ° Penia ^-Induced 
TP and non H.V..nduced .TP,, chronic Idiopathic thrombocytopento, congenita, 
thrombocytopenia, mye.odvsp.asia. and thrombose thrombocytopen*. AddK.ona.ry 

Z^TT opoieac protein8 may be ,n treat,n ° ^'0P~ 

no^rc as wen m "° mbocyto8te ,rom ,nflam — — - 

(TPO^rrr I 8 °' *" * thrombocytopo.et.c prote.n 

sTL - mye, ° ,0XiC Chemottw ™Py *>' -tment of ,eukem.a or 

o..d tumors. mye,oab.at.ve chemotherapy for autobus or aHogenete bone marrow 

1" .mm T'T' 88 ' 8, Wi0Path,C *~ tnrombocytopenl 

and immune thrombocytopenia. ^ ' 
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Still other disorders usefully treatPrt uith *k« 
* invent ,„ cllJ d. a , lecls or J^' o ", 7 "'^'^'^ - 
activation on « sur)aces Tl° ' e5, " , '" 9 *" "™ 9 '' »*— • " 

references cited Sr0 " M *""*• "HO « 

alone .^t 7 9a h karVOCy,wi « fc »«*™ •» ™.»» mv.„,l„„ may B . employ 
"on. o, in con*™,™ rtm 01ner cytokJn „. h8m ., opolMns ^ 

M ra o, e«,bod,.s „ ,„e , realme „, „ . Wia „, B1M ^ ma ^ 

m ! C0n " ,0, " ,a8 ma> "* 10 eom,,in " to " «* <"~ P«* « 

WW. nav,ng tnrombopoletlc aolivity Including; G-CSF, GM-CSF LIF M-CSF IL , 
IL-3. erythropoietin (EPO), kit ligand, IL-6. ana il.ii. ' ' 

The megalcyocytopoietlc proteins „, ,„. | nslM „ lwmfco 
««« - . »a,n_, ^ c>rt „ Thfe * 
be entered , Intravenously ,r th^gh , n , nos . „ ,^ ^ 

be u«« paremerally or subcaneously a «l, M . When ad^nl,*, 

systematically, the ,he„pe„,ie ^ Mim ^ „, 

— » ,„, p„. ^ Ji; n ; 

rir r, ar : ^ ,o - *• -■ b ^ *— «— » - 
i,',rr ,nven "° n - """^ • — — « * 

acceptable camera. .„*,.„,,, or suitors. Sucn J 9 ' 

acd, low n^cia, we,,,, (tes , „»„ abM „„ *« 

o^nine. p ,„ t .,„ s , S uc _ ^ 

hy*oph,,,e po,^ ^ a , o.,^^^ ^ j «■ 

-h a, *; 6DTA: - " ™""^ " -* — ^ 

4 ^ o s '» ™ « a oornpouna or mtor. o, ft. 
P"*. a. ^ ... . 0W „ „„. „,,., phmn . .oc^ 
comp.cn « „, m . p„ ys , ologlca „ y areeplal , |e * .^1^ 

™»e. ataoi,,., „ av0 , we ., M ^ ,„ by 

ZT; * 9 " < " ,nl ln m - »««*•«. - J. 

dosage In the range Indicated Is obtained. 
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Sterile compositions for injection can be formulated according to conventional 
pharmaceutical practice For e* am m« w . , ^nverwonai 
„ . . ° r example - dissolution or suspension of the active 

compound in a vehicle such as water or na t,,, a » 

as water or naturally occurring vegetable oil like sesame 
peanut, or cottonseed oil or a svnth a *„ , «. sesame, 
be dasir** b « 3tty V9hiC,e like e,h y' 0,eate or th. like may 

be desired. Buffers, preservatives, antioxidants and the like can be Incorporated 
accordmg to accepted pharmaceutical practice. incorporated 

ma.rice S S ? b ^H e r Pl9S °' ™ X *™^™* P-P-tton. inc.ude semipermeable 

are in th ^ ^ers containing the polypeptide, which matrices 

a m , e orm of shaped ^ g g ^ ^ ^ 

su .a,ne -re ease matrices inc.ude polyesters, hydroge.s poiy< 2 -hydrox y ethy,- 
m ha « a e) as descrjbed by Unger gtai j ^ Ma(ef fles i5:i67 2 ; 

[1981] and Langer. Cem. fee/,., 1 2:98 . 105 [1982] or P o.y(vinyla.coho.)] 

n oa (U '! , Patent 3 ' 773 ' 919 ' EP 5M81) ' C0P0 ^ S - ac 
and gamma ethyl-L-glutamate (Sidman « a ,. Blowers, 22:547-556 [1983], 

yco,r: ct T ne " v,ny ' ace,ate (Lan9er e,ai - supra) ' ««* -* 

ir r r s such as ,he Lupron Depot ™ ****** 

TH h 3 d " 9lVC0,iC aCid C ° POlymer " teUpr0,,de •"->. "d o,y.D- 

(-)-3-hydroxybutyric acid (EP 133,988). 

enable ^ "V" " ™*« «* acid-glycol.c acid 

nab le .lease of molecules for over 100 days, certain hydrogels release proteins for 
shorter t,me penods. When encapsu.ated proteins remain in the body for a long time 
*ey may denature or aggregate as a result of exposure to moisture at 37*. resuming 
- a loss of b,ological activity and possible changes In immunogenic^. Rational 

25 ~ ca ; 0 r be dev, ; ed : or pro,e,n s,abi,,zawon - 

-nvolved. For example, if the aggregation mechanism is discovered to be 

13 dT ''ir ,0rma,i ° n ""*"»•• -V be 

con Z m ^ 9 SU ' fhydryl r9SidUeS ' ,y0phi " Z,n « '™ «*«• "lutions. 
o trolhng mo,sture content, using appropriate additives, and deveioping spec* 
polymer matrix compositions. *p*«tic 

Susta.ned-re.ease megakaryocytopoietic protein compositions also include 
l.posoma..y entrapped megakaryocytopoietlc protein. Liposomes containing 
megakaryocyte protein are prepared by methods known per sr. DE 3.218 21 
Epstein et al Proc. N atl . Acs*. ScL USA, 82:3688-3692 [1985]; Hwang , 1, £ 

4^^^~ ™ u * pa,em Nos - 

,3*»o, ana tP 102,324. Ordinarily the liposomes are of the 
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greater than about 30 mo! % rh*u ♦ , 

.„ h , i:T, !ex ' aiM ' ,,me an " ,oum - — — « 

™n 8 . ta, 0^ « ir T , "' 0,S ■ TyP ' Ca " y ' aai " "** - 
50 MS*, body Tj, u 7 °" "'" "«» « 

EXAMPLES 

Without further description, i, is Sieved th a , one of ordinary skiii in the art 
can. usmg the preceding description and (Mutative exam D .« T 
Present invention to the fuiiest extent The foil ■ ™ k ° «"« 

■~ Po-n, ou, preferred ^ ^^^T^ ^ 
* construed as Siting ln any way of the remajnder - « « * 

EXAMPLE 1 
Pu rl f, ca t,on of the PorclnB mp , 

*« * P«* . day. . J JUT • ™ 8akaW " ' 0mu " fc » ~* - 

N.C. Th. CO*,™ „ wash* wm J '° m " ""^ «».*, 4M 

v. P „ 1M1 psak Z>T. T ,,utea "* 

«. cam, is washed with 5 co,™ ^ J^L ^T""''- 

oontalnlns 2M „,... Prolelra „„ ™ " ™ ,0mM »W04 (pH 7.4) 

containing 2M „„ a „ d , M N _ « ""^4 (PH 7.4) 

ins eluted protein peak 1, mM , 0 „, % 
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glucoside(n-octyl P-D-glucopyranoside^ anH 1 u 

(Rnannnar ka k i , W™ 0 ***) and 1 mM each with EDTA and Pefabloc 
(Boehmger Mannheim and loaded directiv nntn , , . ^ 

, , A , 4 airecuy onto tandemly linked CD4-lgG (Capon, D J 

,Tr^ 7:525 ' 531 [1989]) ^ mP "° G UM (^erce) co,uls (see 

s ZT, 1 1 T 9G (2 m,) column 18 removed after the samp,e " - - 

mpMgG (4 ml) column is washed with 10 column vo.umes each of PBS end PBS 

loTZ 9 f V 3CI ^ e,U,6d ^ 2 - 25 - F -'ons ar 

collected into 1/I0th volume 1M Tris-HCI (pH 8.0). 

Analysis of eluted fractions from the mp/-affinity column by SDS-PAGE (4- 
20/., Novex gel) run under reducing conditions, revealed the presence of several 
proteins (Fig. 5) . Protejns tnat sj|ver stajn ^ ^ ^ ^ ^ 

wth apparent Mr of 66,000, 55,000. 30,000, 28,000 and 14,000. To determine 
which of these proteins stimulate proliferation of Ba/FS-mp/ cel. cultures these 
protems were eluted Irom the gel as described in Example 2 below. 

Ultral/nk Affinity Columns 
JO-20 mg of mpZ-IgG or CD4.|gG in PBS are coupled to 0.5 grams of Ultrallnk 
resin (Pierce) as described by the manufacturer's instructions. 

Construction and Expression of mpl-IgG 
A chimeric molecule comprising the entire extracellular domain of human mpl 
amino acids ,-491) and the Fc region of a human igQ, molecule was expressed In 
293 cells. A cDNA fragment encoding amino acids 1-491 of human mpl was obtained 
by PCR from a human megakaryocyte CMK cell cDNA library and sequenced A CM 
site was inserted at the 5' end and a BstE.I site at the 3' end. This fragment was cloned 
upstream of the IgG. Fc coding region in a Bluescript vector between the Cla. and the 
BstEII sites after partial digestion of the PCR product with BstEII because of two other 
BstEII sites present in the DNA encoding the extracellular domain of mpl. The BstEII 
site introduced at the ar end of the mpl PCR product was designed to have the Fc region 
in frame with the mpl extraocular domain. The construct was subcloned into P RK5- 
*neo vector between the CU and Xba. sites and transacted No 293 human embryonic 
kidney ceils by the calcium phosphate method. The ceils were seiected in 0.4 mg/ml 
G418 and individual clones were isolated. MpHgG expression from Isolated clones 
was determined using a human Fc specific EL.SA. The best expression done had an 
expression level of 1-2 mg/ml of mp/-lgG. 

Ba/F3 mpl P Expressing Cells 
A cDNA corresponding to the entire coding region of human mpl P was cloned 
■* P RK5-tkneo which was subsequently linearized with Noll and transfected into the 
IL-3 dependent ceif iine Ba/F3 by e.ectroporation (1 x 10? cells. 9605F. 250Vo.ts) 
Three days later selection was started in the presence of 2 mg/ml of G418. The cells 
were selected as pools or individual clones were obtained by limiting dilution in 96 
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well plates. Selected cells were maintained in RPMI containing 15% FBS, 1mg /ml 
G418, 20mM Glutamine, 10mM HEPES and 100 ug/mi of Pen-Strep. Expression of 
mpi P ,n selected clones was determined by FACS analysis using a anti-mp/ P rabbit 
polyclonal antibody. 

Ba/F3 mpi ligand Assay 
The mpi ligand assay was conducted as shown is Fig. 2. To determine the 
presence of mpi ligand from various sources, the mpi P Ba/F3 cells were starved of 
IL-3 for 24 hours at a cell density of 5 x 10* cells/ml in a humidified incubator at 
37'C in 5% C0 2 and air. Following IL-3 starvation the cells were plated out in 96 
well culture dishes at a density of 50,000 cells in 200 uj of media with or without 
diluted samples and cultured for 24 hrs in a cell culture incubator. 20 ul of serum 
free RPMI media containing 1 uCi of 3H-thymidine was added to each well for the last 
6-8 hours. The cells were then harvested on 96 well GF/C filter plates and washed 5 
times with water. The filters were counted in the presence of 40 m of scintillation 
1 5 fluid (Microscint 20) in a Packard Top Count counter. 
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EXAMPLE 2 
Highly Purified Porcine mpi Ligand 
Gel Elution Protocol 

Equal amounts of affinity purified mpi ligand (fraction 6 eluted from the mpl- 
IgG column) and 2X Laemmli sample buffer were mixed at room temperature without 
reducing agent and loaded onto a Novex 4-20% polyacrylamide gel as quickly as 
possible. The sample was not heated. As a control, sample buffer without ligand was 
run in an adjacent lane. The gel was run at 4-6»C at 135 volts for approximately 2 
1/4 hours. The running buffer was inftially at room temperature. The gel was then 
removed from the gel box and the plate on one side of the gel removed. 

A replica of the gel was made on nitrocellulose as follows: A piece of 
nitrocellulose was wet with distilled water and carefully laid on top of the exposed gel 
face so air bubbles were excluded. Fiducial marks were placed on the nitrocellulose 
and the gel plate so the replica could be accurately repositioned after staining. After 
approximately 2 minutes, the nitrocellulose was carefully removed, and the gel was 
wrapped in plastic wrap and placed in the refrigerator. The nitrocellulose was stained 
with Biorad's gold total protein stain by first agitating It in 3 x 10 ml 0.1% Tween 20 
+ 0.5 M NaCI + 0.1 M Trfs-HCI pH 7.5 over approximately 45 minutes followed by 3 
x 10 ml purified water over 5 minutes. The gold stain was then added and allowed to 
develop until the bands in the standards were visible. The replica was then rinsed with 
water, placed over the plastic wrap on the gel and carefully aligned with the fiducial 
marks. The positions of the Novex standards were marked on the gel plate and lines 
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were drawn to indicate the cutting positions. The nitrocellulose and plastic wrap were 
then removed and the gel cut along the indicated lines with a sharp razor blade. The 
cuts were extended beyond the sample lanes so they could be used to determine the 
positions of the slices when the gel was stained. After the slices were removed, the 
5 remaining gel was silver stained and the positions of the standards and the cut marks 
were measured, The molecular weights corresponding to the cut positions were 
determined from the Novex standards. 

The 12 gel slices were placed into the cells In two Biorad model 422 electro- 
eluters. 12-14K molecular weight cutoff membrane caps were used in the cells. 50 
1 0 mM ammonium bicarbonate + 0.05% SDS (approximately pH 7.8) was the elution 
buffer. One liter of buffer was chilled approximately 1 hour In a 4-6°C coldroom 
before use. Gel slices were eluted at 10 ma/cell (40 v initially) in a 4-6°C coldroom. 
Elution took approximately 4 hours. The cells were then carefully removed and the 
liquid above the frit removed with a pipet. The elution chamber was removed and any 

1 5 liquid above the membrane cap removed with a pipet. The liquid in the membrane cap 

was removed with a Pipetman and saved. Fifty u.| aliquots of purified water were then 
placed in the cap, agitated and removed until all the SDS crystals dissolved. These 
washes were combined with the saved liquid above. Total elution sample volume was 
300-500 |il per gel slice. Samples were placed in 10 mm Spectrapor 4 12-14K 

2 0 cutoff dialysis tubing which had been soaked several hours in purified water. They 

were dialyzed overnight at 4-6*C against 600 ml of phosphate buffered saline (PBS is 
approximately 4 mM in potassium) per 6 samples. The buffer was replaced the next 
morning and dialysis continued for 2.5 hours. Samples were then removed from the 
dialysis bags and placed in microfuge tubes. The tubes were placed on ice for 1 hour, 

2 5 microfuged at 14K rpm for 3 min. and the supematants carefully removed from the 

precipitated SDS. The supematants were then placed on ice for approximately 1 hour 
more and microfuged again for 4 min. The supematants were diluted in phosphate 
buffered saline and submitted for the activity assay. Remaining samples were frozen 
at -70°C. 

30 

EXAMPLE 3 
Porcine mpl Ligand Microsequenclng 
Fraction 6 (2.6 ml) from the /npMgG affinity column was concentrated on a 
Microcon-10 (Amicon). In order to prevent the mpl ligand from absorbing to the 

3 5 Microcon, the membrane was rinsed with 1% SDS and 5 uJ of 10 % SDS was added to 

fraction 6. Sample buffer (20 u.1) of 2X was added to the fraction #6 after Microcon 
concentration (20 u.l) and the total volume (40 uJ) was loaded on a single lane of a 4- 
20 % gradient acrylamide gel (Novex). The gel was run following Novex protocol. The 
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gel was than equi.ibrated for 5 mi n . pri0 r ,o electroblotting in 10 mM 
3-(c y clohexy.amino)-1-propanesul ( onic acid (CAPS) buffer, pH 11.0, containing 
10% methanol. E.ec.rob.olting onto Immobi.on-PSQ membranes (Mi.lipore) was 
earned out for 45 min. at 250 mA constant current in a BioRad Trans-Blot transfer 
cell (32). The PVDF membrane was stained with 0.1% Coomassie Blue R-250 in 40% 
methanol. 0.1% acetic acid for 1 min. and destained for 2-3 min. with 10% acetic acid 
in 50% methanol. The only proteins that were visible in the Mr 18,000-35 000 
region of the blot had Mr of 30,000. 28.000 and 22,000. 

Bands at 30, 28 and 22 kDa were subjected to protein sequencing. Automated 
protein sequencing was performed on a model 470A Applied Biosystem sequencer 
equipped with an on-line PTH analyzer. The sequencer was modified to Inject 80-90% 
of the sample (Rodriguez, J. Chromatogr., 350:217-225 [1985]). Acetone (-12 
ul/l) was added to solvent A to balance the UV absorbance. Electroblotted proteins 
were sequenced in the Blot, cartridge. Peaks were integrated with Justice Innovation 
software using Nelson Analytical 970 interfaces. Sequence interpretation was 
performed on a VAX 5900 (Henzel ef a/.. J. Chromatogr., 404:41-52 [1987]) N- 
terminal sequences (using one letter code with uncertain residues in parenthesis) and 
quantity of material obtained (in brackets) is presented in Table 2'. 



TABLE 2' 



■ — — """no'terminus sequence 

30 kDa [1.8 pmolj 

1 5 10 15 20 25 
(S)PAPPA(C)D PRLLNKLLRnn fH /S) V L H (G) R I 


s 


28 kDa [0.5 pmol] 

1 5 10 15 20 25 
(SLP APPAXDPRLLNKLLRDDfHWLMGR 


(SEQ ID NO: 30) 


18-22 kDa [0.5 pmol] 
1 5 10 
XPAPPAXDPRLX(NWK) 


(SEQ ID NO: 31) 
(SEQ ID NO: 32) 



EXAMPLE 4 

Liquid Suspension Megakaryoeytopolasla Assay 

Human peripheral stem cells (PSC) (obtained from consenting patients) were 
diluted 5 fold with IMDM media (Gibco, and centrifuged for 15 min. at room temp at 
800 x g. The cell pellets were resuspended in IMDM and layered onto 60% Percoll 
(density 1.077 gm/ml ) (Pharmacia) and centrifuged at 800 x g for 30 min The 
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1 5 



20 



25 



light density mononuclear cells were aenirat*H «♦«..._« 

15 were aspirated at the interface and washed 2x with 
"* P'ated out at 1-2 x 10 6 cells/ml in |MDM 3Q% ^ ^ 

voiume) in 24 we.l tissue culture clusters (Costar). APP or mpl ligand deleted APP 

TJcTw rn CU ' tUreS W6re 9r ° Wn '° r 12 - U d3yS h a h -' d « ied 'ncubator 

ai v, in 5 /, C0 2 and air. The cultures were also grown in the presence of 10% APP 

with 0.5 ng of mpMgG added at days 0. 2 and 4. APP was depleted of mpl ligand by 
passing APP through a mpAIgG affinity column. 

To quantitate megakaryocytopoiesis in these liquid suspension cultures a 
modification of Solberg et al. was used and employs a radiolabeled murine IgG 
monoclonal antibody (HP1-1D) to GPIIbllla (provided by Dr. Nichols. Mayo Clinic) 
100 ng of HP1-1D (see Grant, B. er al., Blood 69:1334-1339 [1987J) was 
radiolabeled with 1mCi of n.125, using Enzymobeads (Biorad. Richmond, CA) as 
described by the manufacturer's instructions. Radiolabeled HP1-1D was stored at 
■70-C in PBS containing 0.01% octyl-g.ucoside. Typical specific activfties were 1-2 
x 106 cpm/ug (>95% precipated by 12.5% trichloroacetic acid ). 

Liquid suspension cultures were set up in triplicate for each experimental 
point. After 12-14 days in culture the 1ml cultures were transferred to 1.5ml 
eppendorf tubes and centrifuged at 800 x g for 10 min. at room temp, and the resultant 
cell pellets were resuspended in 100 m of PBS containing 0.02% EDTA and 20% 
bovine calf serum. 10ng of 125 W 1-1D in 50 u. of assay buffer was added to the 
resuspended cultures and incubated for 60 min. at room temperature (RT) with 
occasional shaking. Subsequently, cells were collected by centrifugation at 800 x g for 
10 min. at RT and washed 2x with assay buffer. The pellet* were counted for 1 min in 
a gamma counter (Packard). Non-specific binding was determined by adding 1 ug of 
unlabeled HP1-1D for 60 min. before the addition of labeled HPMD Specific 
binding was determined as the total 125,. HP1 . 1D bound mjnus ^ ^ jn ft- 
presence of excess unlabeled HP1-1D. 



EXAMPLE 5 

30 Oligonucleotide PCR Printer* 

Based on the amino-terminal amino acid sequence obtained from the 30 kDa, 28 
kDa and 18-22 kDa proteins, degenerate oligonucleotides were designed for use as 
polymerase chain reaction (PCR) primers (see Table 4). Two primer pools were 
synthesized, a positive sense 20 mer pool encoding amino acid residues 2-8 (mpl 1) 
and an antl-sense 21-mer pool complimentary to sequences encoding amino acids 18- 
24 (mpl 2). 
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TABLE 4 

Regenerate Oligonucleoti de Primer Pools 



-i ^ .^^ rmnqt ro 

rnp/ 1 :y CCN GCN CCN CCN nru rr.v » , 2 04B ,, Q|d dftnan 

mn/ Mrr DT/i map, * __ L 



mp/ 2:5" NCC RTG NAR NAC RTG RTC RTP » (2 ,048. fo , d dpn.no^ 



(SEQ ID NO: 351 



(SEQ ID NO: 36) 



1 0 



Porcine genomic DNA. isolated from porcine peripheral blood lymphocytes, was 

Zl T 50 ^ rSaCti0n COn,ained: 0 8 « °« P«*» fl"nomlo 

DNA in 10mM Tris-HCI (pH 8.3). 50mM KC. 3mM MgCI 2 . 100 pg/ml BSA, 400 nM 

dNTPs. 1 uM of each primer pool and 2.5 units of Tap polymerase. Initial template 
denaturation was at 94°C for 8 min. followed by 35 cycles of 45 seconds at 94'C 1 
m.n. at 55'C and 1 min. at 72'C. The final cycle was allowed to extend for 10 min.' at 
72-C. PCR products were separated by electrophoresis on a 12% polyacrylamide gel 
and visualized by staining with ethidium bromide. It was reasoned that If the amino- 
terminal amino acid sequence was encoded by a single axon then the correct PCR 
product was expected to be 69 bp. A DNA fragment of this size was eluted from the gel 
and subcloned Into pGEMT (Promega). Sequences of three clones are shown below in 
1 5 Table 5. 

TABLE 5 

69 bp Porcine Genomic DNA Frag ment* 



gemT3 

s- ccAGCGCcqf; cascotc, T ga , cccccgactc ctaaataaac tgcctcgtga 

3'GGTCGCGGCG GTCGGACACT GGGGGCTGAG GATTTATTTG ACGGAGCACJ 

TGACCACGTT CAGCACGGC [69 bp] (SEQ ID NO: 37) 
ACTGGTG^AA GJCmnr.r.r, (SEQ |D NO; ^ 



gemT7 

S' CCAQCAOOTP OqqpATqTqA CCCCCGACTC CTAAATAAAC TGCTTCGTGA 
3'GGTCGTGGAG GCCGTACACT GGGGGCTGAG GATTTATTTG ACGAAGCACJ 

CGACCACGTC CATCACGGC [69 bp] (SEQ ID NO: 39) 

GCTGGTRHA» GTAGTf^nnr. (SEQ [D NQ; ^ 
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gemT9 

p RLLNKL L R (SEQ ID 

NO: 32) 

5 1 CCASCACCQCCQfiOATP>TfiA CCCCCGACTCCTAAATAAACTGCTTCGTGACG 
3' GGTCGTGGCGGCCGTACACTGGGGGCTGAGGATTTATTTGACGAAGCAdfiC 



ATCATGTCTATCACGGT 3' (SEQ ID NO: 41) 
TAGTACAftATArvr^A c f g EQ |p NQ . 4g) 



The position of the PCR primers is indicated by the underlined bases. These results 
verify the N-terminal sequence obtained for amino acids 9-17 for the 30 kDa, 28 kDa 
and 18-22 kDa proteins and Indicated that this sequence is encoded by a single exon of 
5 porcine DNA. 

EXAMPLE 6 
Human mpl Ugand Gene 
Based on the results from Example 5, a 45-mer deoxyoligonucleotide, called 
1 0 pR45, was designed and synthesized to screen a genomic library. The 45-mer had the 
following sequence: 

5' GCC-GTG-AAG-GAC-GTG-GTC-GTC-ACG-AAG-CAQ-TTT-ATT-TAG-GAG-TCQ 3" 

(SEQ ID NO: 28) 

This oligonucleotide was 32p.| abeled ^ (l 32 P) . ATP ^6 T4 Wnase ^ ^ 
15 to screen a human genomic DNA library in Xgem12 under low stringency hybridization 
and wash conditions (see Example 7). Positive clones were picked, plaque purified 
and analyzed by restriction mapping and southern blotting. Clone #4 was selected for 
additional analysis. 

A 2.8 kb BamHI-Xbal fragment that hybridized to the 45-mer was subcloned 
2 0 into pBluescript SK-. Partial DNA sequencing of this clone was preformed using as 
primers oligonucleotides specific to the porcine mpl Ugand DNA sequence. The 
sequence obtained confirmed that DNA encoding the human homolog of the porcine mpl 
ligand had been isolated. An EcoRI restriction site was detected in the sequence 
allowing us to isolate a 390 bp EcoRI-Xbal fragment from the 2.8 kb BamHI-Xbal and 
25 to subclone it in pBluescript SK-. 

Both strands of this fragment were sequenced. The human DNA sequence and 
deduced amino acid sequence are shown In Fig. 9 (SEQ ID NOS: 3 & 4). The predicted 
positions of introns in the genomic sequence are also indicated by arrows, and define a 
putative exon ("exon 3"). 
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1 0 



Exam^on of the predicted amino acid sequence confirms that a serine residue 
,s the f.rst amino acid of the mature mpl Hgand. as determined from direct amino acid 
sequence analysis. Immediately upstream from this codon the predicted amino acid 
sequence ,s highly suggestive of a signal sequence involved In secretion of the mature 
mpl Hgand. This signal sequence coding region is probably interrupted at nucleotide 
position 68 by an intron. 

In the 3' direction the exon appears to terminate at nucleotide 196. This axon 
therefore encodes a sequence of 42 amino acids, 16 of which are likely to be part of a 
s.gnal sequence and 26 of which are part of the mature human mpl ligand 



EXAMPLE 7 
Full Length Human mpl Ugand cDNA 
Based on the human "exon 3" sequence (Example 6) two non-degenerate 
oligonucleotides corresponding to the 3' and 5' ends of the "axon 3" sequence were 
1 5 synthesized (Table 6). 

TABLE 6 

Human cDNA Non-deaen«r a t» PC R oi| flon „ C | eot | d p Hmar . 



Fwd primer 5' GCT AGC TCT AGA AAT TGC TCC TCG TGG TCA TGC TTC T 3' 



(SEQ ID NO: 
43) 



Rvs primer: 5' CAG TCT GCC GTG AAG GAC ATG G 3' 



(SEQ ID NO: 
44) 



20 



25 



30 



These two primers were used in PCR reactions employing as a template DNA 
from various human cDNA libraries or 1 ng of Quick Clone cDNA (Clonetech) from 
various tissues using the conditions described in the Example 5. The expected size of 
the correct PCR product was 140 bp. After analysis of the PCR products on a 12% 
polyaaylamide gel, a DNA fragment of the expected size was detected in cDNA libraries 
prepared from adult kidney. 293 fetal kidney cells and cDNA prepared from human 
fetal liver (Clonetech cat. #7171-1). 

A fetal liver cDNA library in X DR2 (Clonetech cat. # HL1151x) was screened 
with the same 45 mer oligonucleotide used to screen the human genomic library The 
oligonucleotide was labelled with (^PJ-ATP using T4 polynucleotide kinase. The 
library was screened under low stringency hybridization conditions. The filters were 
prehybridlzed for 2hr then hybridized with the probe overnight at 42'C in 20% 
formamlde. SxSSC, 10xDenhardfs. O.OSM sodium phosphate (pH 6.5), 0.1% sodium 
pyrophosphate. 50 ug/ml of sonicated salmon sperm DNA for 16hr. Filters were then 
rinsed in 2xSSC and then washed once in 0.5xSSC. 0.1% SDS at 42"C Filters were 
exposed overnight to Kodak X-Ray film. Positive clones were picked, plaque purified 
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1 5 



and the insert size was determined hv ore , 

ueiermmea Dy PCR using oligonucleotides flanking the BamHI- 

Xbal cloning in X DR2 (Clonetech cat. #6475-1). 5 „ of phage stock was used as a 
template source. Initial denaturation was for 7 min. at 94°C followed by 30 cycles of 
amplification (1 min. at 94<C, 1 min. at 52*C and 1.5 min. at 72-Q. Final extention 
was for 15 min. at 72°C. Cone # FL2b had a 1.8kb insert and was selected for 
further analysis. 

The plasmid pDR2 (Clonetech. XDR2 & pDR2 cloning and Expression System 
Library Protocol Handbook, p 42) contained within the XDR2 phage arms, was rescued 
as described per manufacturer's instructions (Clonetech, XDR2 & P DR2 cloning and 
Expression System Library Protocol Handbook, p 29-30). Restriction analysis of the 
plasmid pDR2-FL2b with BamHI and Xbal indicated the presence of an internal BamHI 
restriction site in the insert approximately al position 650. Digestion of the plasmid 
with BamHI-Xbal cut the insert in two fragments, one of 0.65 kb and one of 1.15 kb. 
DNA sequence was determined with three different classes of template derived from the 
plasmid pDR2-FL2b. DNA sequencing of double-stranded plasmid DNA was carried out 
with the ABI373 (Applied Biosystems. Foster City, California) automated fluorescent 
DNA sequencer using standard protocols for dye-labeled dideoxy nucleoside 
triphosphate terminators (dye-terminators) and custom synthesized walking primers 
(Sanger et al., Proc. Natl. Acad. Sci. USA, 74:5463-5467 (1977); Smith et al 
20 Nature. 321:674-679 [1986]). Direct sequencing of polymerase chain reaction 
amplified fragments from the plasmid was done with the ABI373 sequencer using 
custom primers and dye-terminator reactions. Single stranded template was generated 
wilh the M13 Janus vector (DNASTAR. Inc.. Madison, Wisconsin) (Buriand at al 
Nucl. Acids Res., 21:3385-3390 [1993]). BamHI-Xbal (1.15 kb) and BamHI (0 65 
2 5 kb) fragments were isolated from the plasmid pDR2-FL2b, the ends filled in with T4 
DNA polymerase In the presence of deoxynucleotldes. and then subcloned into the Smal 
site of M13 Janus. Sequencing was carried out with standard protocols for dye-labeled 
M13 universal and reverse primers, or walking primers and dye-terminators 
Manual sequencing reactions were carried out on single strand M13 DNA using walking 
primers and standard dideoxy-terminator chemistry (Sanger et al., Proc. Natl Acad 
Sci. £,54.74:5463-5467 [1977]), 33p. labe(ed oWp ^ ^.^ 
States Biochemical Corp., Cleveland, Ohio). DNA sequence assembly was carried out 
with Sequencer V2.1b12 (Gene Codes Corporation. Ann Arbor. Michigan). The 
nucleotide and deduced sequences of hML are provided in Fig. 1 (SEQ ID NO: 1). 
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EXAMPLE 8 

isaaaon of the Human mpf Ugan(j Gen9 

Human genomic DNA clones of th« tdh 
genomic iibrary in X-Gem, 2 wi, h D T ™ ^ * * ^ 

5 under low stringency cond" 3 PreV '° USly deSCrib ° d 0ii ^^ »**. 

conditions with a rZn co ^ ?> °' ^ ^""^ 

mpl ligand (from h i C ° rreSP ° ndin9 ,0 ,he 3 ' ha » °< cDNA coding for the 

^zizz^v ,he 3,end) - tw ° ° ver,appin9 ,amMa 

containinotheenrrPO ne ,ra9men ' S ^ - E ^ 

9 entire TPO gene were subcloned and sequenced. The structure of the 
0 uman gene is composed of 6 exons wjthjn 7 kb Qf — ^ 

" ab r;;:; a " r n,ron iunc,ions are ™ - 

1 Ex n TT " neS (ShaPir °' M " B ' " NUC '- «** «~ «-™ 
[1987]). Exon 1 and exon 2 contain 5' untranslated sequence and the Initial four 

-no ac ,ds of the signal peptide. The remainder of the secretory J?JZ Z 

> 26 am.no ac.ds of the mature protein are encoded within exon 3. The entire 2b I 

.a, -d , untranslated as we„ as -50 amino acids of the erythropol^ 

doma,n are encoded within exon 6. The four amino acids invoked in the 

observed within hML-2 (hTPO-2) are encoded at the 5' end of exon 6. 

EXAMPLE 9 

i„ Jr,"'"l , e " re "'°" " """»» »»•»« wg 

lo o,«, ,o subclone ,„„ ,„„ glh ,„„.„ pDR2-FL2b tn„ 

pnr\o (pHK5-nmp/ I) see U.S. Patent No 5 2<5fl otrr ^ 

construct pRKS-hmp/ 1 was prepared by the PEG method and transfected in Hun™ 

ed) Vol. ... pp. ^-190, IRL Press, Washington D C) 36h ,» , . 

r s MrateHM — J - °^ r:r rrrc 
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EXAMPLE 10 
Human Mpl LIgand Isoforms 
hML2, hML3, and HML4 
In order to identify alternatively spliced forms of hML, primers were 
synthes.zed corresponding to each end of the coding sequence of hML. These primers 
were employed in RT-PCR to amplify human adutt liver RNA. Additionally, internal 
primers flanking selected regions of interest (see below) were constructed and 
s.m,larty employed. Direct sequencing of the ends of the PCR product revealed a single 
sequence corresponding exactly to the sequence of the cDNA isolated from the human 
fetal liver library (see Rg. 1 [SEQ ID N0: 1J} How(JV8r> a ^ ^ ^ ^ 

termmus of the EPO-domain (in .he middle of the PCR product) exhibited a complex 
sequence pattern sugesting the existence of possible splice variants in that region. To 
Isolate these splice variants, the primers provided in Table 7 flanking the region of 
1 5 interest were used in a PCR as templates for human adult liver cDNA. 



1 0 
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25 



30 



TABLE 7 



— uuiurm run Knmers 

phmpllcdna.3e1 : 5TGTGGACTTTAGCTTGGGAGAATG3' 


(SEQ ID NO: 45} 


.£bx4.f2: 5'GGTCCAGGGACCTGGAGGTTTG3' 


(SEQ ID NO: 46) 



35 



r SUUWUM,,U olu ™ wo mi 3. sequencing of individual 

subclones revealed the existence of at least 3 ML isoforms. One of them, hML (also 
refered to as HML332). is the longest form and corresponds exactly to the sequence 
iso«ated from the fetal liver library. Sequences of the four human mpl llgand Isoforms 
Lsted from longest (hML) to shortest (hML-4) are provided in (Fig. 11 [SEQ ID 
NOS: 6, 8, 9 4 10]). 

EXAMPLE 11 

Construction and Transient Expression of Human Mpl Llgand 
Isoforms and Substitutional Variants 

hML2, hML3, and hML(R1S3A, R1S4A) 

Isoforms hML2 and HML3 and substitutional variant hML(R153A R154A) 
were reconstituted from hML using the recombinant PCR technique described by 
Russell Higuchl, in PCR Protocols, A guide to Methods and AppHcatlons, Acad. Press 
MAInnis, D.H. Geffand, Jj. Sninsky & T.J. White Editors. 

in all contracts, the "outside" primers used are shown in Table 8 and the 
■overlapping' primers are shown in Table 0. 
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TABLE 8 



P"tslde Primers 


Cla.FL.F2: " — 




5'ATC GAT ATC GAT AGC CAG ACA CCC CGG CCA G3' 


(SEQ ID NO: 


HMPLL-fl: 


_ 47) 


5'GCT AGC TCT AGA CAG GGA AGG GAG CTG TAC ATG AGA3' 


(SEQ ID NO: 
48) 



h M L-2 : 



TABLE 9 
Overlappin g Primers 



MLA4.F: 
MU4.R 



5'CTC CTT GGA ACC CAG GGC AGG ACC 3' 
5'GGT CCT GCC CTG GGT TCC AAG GAG 3' 



(SEQ ID NO: 49) 



hML-3: 

hMLA116 + : 5'CTG CTC CGA GGA AAG GAC TTC TGG ATT 3' 
hMLA116-: 5'AAT CCA GAA GTC CTT TCC TCG fiAP. CAG 3. 



(SEQ ID NO: 51) 



RR-KO-F: 5'CCC TCT GCG TCG CGG CGG CCC CAC CCA C 3' 
RR-KO-R: 5'GTG GGT GGG GCC GCQ GCG ACQ CAP, Ann n »■ 



(SEQ ID NO: 53) 



1 0 



15 



... „ Q " — " " | ( SEQIDNO:54) | 

(StratanJ 7 a 7 ,i,i f Cations «» P^o-ed with cioned Plu DNA polymerase 
S.ratagene) us.ng ,he foi.owing condftions: Initial tempiate denaturation was at 
for 7 ™ foilowed by 30 cycles o, 1 mi, a, 94-c. 1 min. at 55'C and 1.5 min. at 
72 a The f,nal cycle was aHowed to extend for 10 min. at 72'C. The final PGR product 
was digested with Clal-Xba., gel purified and Coned in pRKSfcneo. 293 ce.,s were 
transfected with the various constructs as described above and the supernatant was 
assayed using the Ba/FS-mp, pro.iferat.on assay. hML-2 and hML-3 showed no 
detectable actMty ,n this assay, however the act«vi,y 0 , hML(R153A, R154A) was 
s.m,ar to hML Indicating that processing a, this di-basic site is not retired Z 
activity (see Fig. 13). 
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EXAMPLE 12 
Murine mpl Ugand cDNA 
mML, mML-2 and mML-3 

Isolation of mML cDNA. 

5 A DNA fragment corresponding to the entire coding region of the human mpl 

ligand was obtained by PCR, gel purified and labeled by random priming in the 
presence of 32p. dATP and 32 P . dCTP This probe was use<J tQ screep 1q6 c|Qnes Qf g 

mouse liver cDNA library in A.GT10 (Clontech cat# ML3001a). Duplicate filters 
were hybridized in 35% formamide, SxSSC, 10xDenhardfs, 0.1% SDS t 0.05M 
1 0 sodium phosphate (pH 6.5), 0.1% sodium pyrophosphate, 100 jig/ml of sonicated 
salmon sperm DNA overnight in the presence of the probe. Filters were rinsed in 
2xSSC and then washed once In O.SxSSC, 0.1% SDS at 42°C. Hybridizing phage were 
plaque-purified and the cDNA inserts were subcloned into the Eco R1 site of Bluescript 
SK- plasmid. Clone "LD" with a 1 .5 kb insert was chosen for further analysis and both 

1 5 strands were sequenced as descibed above for the human ML cDNA. The nucleotide and 

deduced amino acid sequences from clone LD are provided in Fig. 14 (SEQ ID NOS: 1 & 
11). The deduced mature ML sequence from this clone was 331 amino acid residues 
long and identified as 1T1ML331 (or mML-2 for reasons described below). 
Considerable identity for both nucleotide and deduced amino acid sequences were 

2 0 observed in the EPO-like domains of these ML's. However, when deduced amino acid 

sequences of human and mouse ML's were aligned, the mouse sequence appeared to have 
a tetrapeptide deletion between human residues 111-114 corresponding to the 12 
nucleotide deletion following nucleotide position 618 seen in both the human (see 
above) and pig (see below) cDNA's. Accordingly, additional clones were examined to 

2 5 detect possible murine ML isoforms. One clone, "L7\ had a 1.4 kb insert with a 335 

amino acid deduced sequence containing the "missing" tetrapeptide LPLQ. This form is 
believed to be the full length murine ML and is refered to as mML or mML33s. The 
nucleotide and deduced amino acid sequence for mML are provided in Fig. 16 (SEQ ID 
NOS: 12 & 13). Rnaliy, clone "L2 W was isolated and sequenced. This clone has the 
30 116 nucleotide deletion corresponding to hML3 and is therefore denominated mML-3. 
Comparison of the deduced amino acid sequences of these two isoforms is shown In Rg. 
16. 

Expression of recombinant mML Expression vectors for murine ML were 
prepared essentially as described in Example 8. Clones encoding mML and mML-2 

3 5 were subcloned into pRKStkneo, a mammalian expression vector that provides 

expression under the control of the CMV promoter and an SV40 polyadenylation signal. 
The resulting expression vectors, mMLpRKtkneo and mML2pRKtkneo were transiently 
transfected into 293 cells using the calcium phosphate method. Following transient 
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glucose DMEM med,a supplemented with 10% fetal calf serum 

express^TlT °' ^ *™ ^ Stabte "n- 

Z fo h " ° btainSd ^ ,ranSfeCti ° n °' ^ PRK,kne0 ' - 

^ i a,n,n9 en "' re C ° ding SGqU9nCe °' mUrine "*"< Stoda - «■ C er a/ 
™ 0 ,.12:2645-2653 f 1993]) was transfected into Ba/F3 ce.,s b 

electroporation (5 X io' cells. 250 volts. 960 uF) fo „ 0 wed by se.ection for 

neomycin, resistance with 2 mg/ml G418. Expression of mp, was assessed by flow 

cytometry analysis using rabbit anti-murine mpMgG antisera. Ba/F3 cells were 

ma,ntained in RPMI 1640 media from WEHI -3B cel.s as a source of IL-3 

Supernatants from 293 cel.s transiently transfected with both mML and mML-2 were 

assayed in BaF3 cells transfected with both mmp/ and hmp, as described in Example 
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EXAMPLE 13 
Porcine mpl Ligand cDNA 
pML and pML-2 

Porcine ML (pML) cDNA was isolated by RACE PGR. Briefly, an o.igo dT 
2 0 primer and 2 specific primers were designed based on the sequence of the exon of the 
porcne ML gene encoding the amino terminus of the ML purified from the aplastic pig 

PcTnuT ' r ° m Vari ° US 8P,aStiC Pi9 ^ WM ° bteined and ""*«-■ A 

Pr ° dUCt * 1342 * * as *"* Sidney and subdoned. Several clones were 
sequenced and found to encode the mature pig mpl ligand (no, including a comp.ete 
25 ^ The cDNA was found to encode a 332 amino acid mature protein 
(PML332) havmg the sequence shown in Fig. 18 (SEQ ID NOS: 9 & 16). 

Method: 

^JT* 0 " ^ 9en ° CDNA Gen ° m,C ° f ML ■« were 

■soiated by screening a pig genomic library in EMBL3 (Contech .nc.) with pR45 The 

.brary was screened essentiaiiy as described in Examp,. 7 . Severe. Cones were 
isolated and the exon encoding amino acid sequence identical to that obtained from the 
purged ML was sequenced. Porcine ML cDNA were obtained using a modlficatJon o, the 
RACE PCR protocol. Two specific ML primers were designed based on the sequence of 
the pig ML gene. Polyadenylated mRNA was isolated from the kidney of aplastic pigs 
essential^yas previously described. cDNA was prepared by reverse transcription 1 
tne BamdT primer 

(BamdT : 5' GACTCGAGGATCCATCGAI 1 1 1 1 1 1 1 1 j 1 1 1 1 ) 1 1 3.) 
(SEQ ID NO: 55) 
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directed against the polyadenosine tail of the mRNA. An initial round of PCR 
amplification (28 cycles of 95°C for 60 seconds, 58°C for 60 seconds, and 72»C for 
ninety seconds) was conducted using the ML specific h-forward-1 primer 

(h-forward-1: 5' GCTAGCTCTAGAAATTGCTCCTCGTGGTCATGCTTCT 3') 

(SEQ ID NO: 43) 

and the BAMAD primer 

(BAMAD: 5' GACTCGAGGATCCATCG 3') 
(SEQ ID NO: 56) 

in a 100 ml reaction (50 mM KCI. 1.5 mM MgCI, 10 mM Tris pH 8.0, 0.2 mM 
dNTPs.with 0.05 U/ml Amplitaq polymerase [Perkin Elmer Inc.]) The PCR product 
was then digested with Cla1, extracted with phenol-chloroform (1:1), ethanol 
precipitated, and ligated to 0.1 mg of Bluescript SK- vector (Stratagene inc.) that had 
been cut with Cla1 and Kpn 1. After incubation for two hours at room temperature, 
one fourth of the ligation mixture was added directly to a second round of PCR (22 

1 5 cycles as described above) using a second ML specific forward-1 primer 

(forward-1: 5' GCTAGCTCTAGAAGCCCGGCTCCTCCTGCCTG 3') 
(SEQ ID NO: 57) 

and T3-21 (an oligonucleotide that binds to a sequence adjacent to the multiple cloning 
region within the Bluescript SK- vector): 

2 0 (5' CGAAATTAACCCTCACTAAAG 3') 

(SEQ ID NO: 58). 

The resulting PCR product was digested with Xba1 and Clal and subclone* into 
Bluescript SK-. Several clones from indepedent PCR reactions were sequenced. 

Again, a second form, designated pML-2, encoding a protein with a 4 amino add 
residue deletion (328 amino acid residues) was identified (see Fig. 21 [SEQ ID NO: 
21]). Comparison of pML and pML-2 amino acid sequences shows the latter form is 
identical except that the tetrapeptide QLPP corresponding to residues 111-114 
inclusive have been deleted (see Fig. 22 [SEQ ID NOS: 18 & 21]). The four amino 
acid deletions observed in murine, human and porcine ML cDNA occur at precisely the 
3 0 same position within the predicted proteins. 

EXAMPLE 14 

CUK Assay for Thrombopoietln (TPO) Induction of Platelet Antigen 

GPIlbllla Expression 
CMK cells are maintained in RMPI 1640 medium (Sigma) supplemented with 
10% fetal bovine serum and 10mM glutamine. In preparation for the assay, the cells 
are harvested, washed and resuspended at 5x1 0* cells/ml in serum-free GIF medium 
supplemented with 5mg/l bovine insulin, I0mg/l apo-transferrin. 1 X trace 
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elements. In a 96-well flat-bottom Dlata the TPr> **^m 

. , piai9, tne TPO standard or experimental samples 

are added to each wel. at appropriate dilutions in 100 „■ volumes. 100 ,l of the CMK 
cel. suspens.cn ,s added to each we., and the plates are incubated at arc. in a 5% C0 2 
incubator tor 48 hours. After incubation, the plates are spun at lOOOrpm at 4'C for 
f.ve m,nutes. Supernatants are discarded and 100 u. of the FITC-conjugated GPII b ll| a 
monoclonal 2D2 antibody is added to each well. Following incubation at 4'C for 1 hour 
Plates are spun again a. lOOOrpm for five minutes. The supernatants containing 
unbound antibody are discarded and 200 m of 0.1% BSA-PBS wash is added to each 
well. The 0.1% BSA-PBS wash step is repeated three times. Cells are then analyzed 
on a FASCAN using standard one parameter analysis measuring relative fluorescence 
intensity. 
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EXAMPLE 15 

DAMI Assay for Thrombopoletin (TPO) by Measuring Endomltotlc 
Activity of DAMI Cells on 96-well Mlcrotiter Plates 
DAMI cells are maintained in IMDM + 10% horse serum (Gibco) supplemented 
wrth 10mM glutamine. lOOng/ml Penicillin G. and 50 ,xg/ml streptomycin In 
preparation for the assay, the cells are harvested, washed, and resuspended at 
1x106 ce .ls/ml in IMDM + 1% horse serum. In a 96-we.l round-bottom plate. 100 
Hi of the TPO standard or experimental samples is added to DAMI cell suspension Cells 
are then incubated for 48 hours at 37°C in a 5% C0 2 incubator. After incubation 
plates are spun in a Sorvall 6000B centrifuge at 1000rpm for five minutes at 4'C 
Supernatants are discarded and 200 m of PBS-0.1% BSA wash step Is repeated Cells 
are fixed by the addition of 200 nl ice-cold 70% Ethanol-PBS and resuspended by 
aspiration. After incubation at 4'C for 15 minutes, the plates are spun at 2000 rpm 
for five minutes and 150 ul of 1mg/ml RNAse containing 0.1mg/ml propidium iodide 
and 0.05% Tween-20 Is added to each well. Following a one hour incubation at 37'C 
the changes In DNA content are measured by flow cytometry. Polyploidy is measured 
and quantltated as follows: 

Normalized Polyploid Ratio (NPR, = (%Ce| ls in >Q ?+ M/%r all8 m m Jn 

(%Cells in >G2+M/%Cella in <G2+M) in control 
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EXAMPLE 16 
Thrombopoietin (TPO) In Vivo Assay 
(Mouse Platelet Rebound Assay) 
In Vivo Assay for 35 s Determlnatlon Qf pM&M Production 

C57BL6 mice (obtained from Charles River) are injected intraperitoneal^ 
(IP) with 1 ml goat anti-mouse platelet serum (6 amps) on day 1 to produce 
thrombocytopenia. On days 5 and 6. mice are given two IP injections of the factor or 
PBS as the control. On day 7, thirty ^Ci of Na 2 35 S0 4 in 0.1 ml saline . are injected 
intravenously and the percent 35 s incorporation of the injected dose into circulating 
platelets is measured in blood samples obtained from treated and control mice. Platelet 
counts and leukocyte counts are made at the same time from blood obtained from the 
retro-orbital sinus. 
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EXAMPLE 17 
KIRA ELISA for Thrombopoietin (TPO) 
by Measuring Phosphorylation of the mpl-Rse.gD Chimeric Receptor 
The human mpl receptor has been disclosed by Vigon et a/., PNAS, USA 
89:5640-5644 (1992). A chimeric receptor comprising the extracellular domain 
(ECD) of the mpl receptor and the transmembrane (TM) and intracellular domain 
(ICD) of Rse (Mark et a/., J. of Biol. Chem. 269(1 4): 10720- 10728 [1994]) with 
a carboxyl-terminal flag polypeptide (i.e. Rse.gD) was made for use in the KIRA ELISA 
described herein. See Fig. 30 and 31 for a diagrammatic description of the assay. 

(a) Capture agent preparation 

Monoclonal anti-gD (clone 5B6) was produced against a peptide from Herpes 
simplex virus glycoprotein D (Paborsky et a/., Protein Engineering 3(6):547-553 
[1990]). The purified stock preparation was adjusted to 3.0mg/ml in phosphate 
buffered saline (PBS), pH 7.4 and 1.0ml aliquots were stored at -20' C. 

(b) Anti-phosphotyrosine antibody preparation 

Monoclonal anti-phosphotyrosine, clone 4G10, was purchased from UBI (Lake 
Placid, NY) and biotlnylated using long-arm biotln-N-hydroxysuccinamide (Biotin- 
X-NHS, Research Organics, Cleveland, OH). 

(c) Ugand 

The mpl ligand was prepared by the recombinant techniques described herein. 
The purified mpl ligand was stored at 4 'C. as a stock solution. 
3 5 (d) Preparation of Rse.gD nucleic acid 

Synthetic double stranded oligonucleotides were used to reconstitute the coding 
sequence for the C-terminal 10 amino acids (880 - 890) of human Rse and add an 
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additional 21 amino acids containing an epitope for the antibody 5B6 and a stop codon. 
Tab.e 10 presents the final sequence of the synthetic portion of the fusion gene. 

TABLE 10 

Synthetic Dw „hi. c,.,,.^ Port|on pf Human Rsfl pua|Qn ^ 



coding strand: 

S'-TGCAGCAAGGGCTACTGCCACACTCGAGCTGCGCAGATGCTAGCCTCAAGA 
TGGCTG ATCCAAATCGATTCCGCGGCAAAGATCTTCCGGTCCTGTAGAAGCT-3 1 



noncoding (anti-sense) strand: 

5'-AGCTTCTACAGGACCGGAAGATCTTTGCCGCGGAATCGATTTGGATCAGCCA 
TCTTG AGGCTAGCATCTGCGCAGCTCGAGTGTGGCAGTAGCCCTTGCTGCA-3' 



(SEQ ID 
NO: 59) 



(SEQ 10 
NO: 60) 



The synthetic DNA was ligated with the cDNA encoding amino acids 1-880 of 
human Rse at the Pstl site beginning at nucleotide 2644 of the published human Rse 
cDNA sequence (Mark era/., Journal of Biological Chemistry 269(1 4)- 10720- 
10728 [1994]) and Hindlll sites in the polylinker of the expression vector 
PSVI7..D.LL (See Fig. 32 A-L; SEQ ID NO: 22) to create the expression plasmid 
pSV.ID.Rse.gD. Briefly, .he expression plasmid comprises a dicistronic primary 
transcnpt which contains sequence encoding DHFR bounded by 5' splice donor and 3' 
splice acceptor intron splice sites, followed by sequence that encodes the Rse.gD The 
full length (non-spliced) message contains DHFR as the first open reading frame and 
therefore generates DHFR protein to allow selection of stable transformarrts. 
(e) Preparation of mpl.Rse.gD nucleic acid 

The expression plasmid pSV.ID.Rse.gD produced as described above was modified 
to produce plasmid pSV.ID.M.tmRde which contained the coding sequences of the ECD of 
human mpi (amino acids 1-491) fused to the transmembrane domain and 
.ntracellular domain of Rse.gD (amino acids 429-911). Synthetic oligonucleotides 
were used to join the coding sequence of a portion of the extracellular domain of human 
mpi to a portion of the Rse coding sequence in a two step PCR cloning reaction as 
described by Mark et a/., J. Biol. Chem. 267:26166-26171 (1992). Primers used 
for the first PCR reaction were M1 

(5'-TCTCGCTACCGTTTACAG-3') 
(SEQ ID NO: 61) 

andM2 
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(5'-CAGGTACCCACCAGGCGGTCTCGGT-3') 
(SEQ ID NO: 62) 

with a mpl cDNA template and R1 

(5'-GGGCCATGACACTGTCAA-3') 
(SEQ ID NO: 63) 

and R2 

(5'-GACCGCCACCGAGACCGCCTGGTGGGTACCTGTGGTCCTT-3') 
(SEQ ID NO: 64) 

with a Rse cDNA template. The Pvull-Smal portion of this fusion junction was used for 
the construction of the full-length chimeric receptor. 
(f) Cell transformation 

DP12.CHO cells (EP 307,247 published 15 March 1989) were electroporated 
with pSV.ID.M.tmRdS which had been linearized at a unique Notl site in the pfasmid 
backbone. The DNA was ethanol precipitated after phenol/chloroform extraction and 
was resuspended in 20nl 1/10 Tris EDTA. Then, 10 M of DNA was incubated with 10? 
CHO DP12 cells in 1 ml of PBS on ice for 10 min. before electroporation at 400 vote 
and 330uf. Cells were returned to ice for 10 min. before being plated into non- 
selective medium. After 24 hours cells were fed nucleoside-free medium to select for 
stable DHFR+ clones. 

2 0 (9) Selection of transformed cells for use in the KIRA ELISA 

Clones expressing MPURse.gD were identified by western-blotting of whole 
cell lysates post-fractionation by SDS-PAGE using the antibody 5B6 which detects the 
gD epitope tag. 

(h) Media 

Cells were grown in F12/DMEM 50:50 (Gibco/BRL, Life Technologies. Grand 
Island, NY). The media was supplemented with 10% diafiltered FBS (HyClone, Logan, 
Utah), 25mM HEPES and 2mM L-glutamine. 

(i) KIRA ELISA 

MpARse.gD transformed DP12.CHO cells were seeded (3x1 0 4 per well) in the 
wells of a flat-bottom-96 well culture plate in 100 jii media and cultured overnight 
at 37 "C in 5% C0 2 . The following morning the well supematants were decanted, and 
the plates were lightly lamped on a paper towel. 50nl of media containing either 
experimental samples or 200. 50, 12.5, 3.12. 0.78, 0.19, 0.048 or 0 ng/ml mpl 
ligand was then added to each well. The cells were stimulated at 37*C for 30 min., the 
well supematants were decanted, and the plates were once again lightly tamped on a 
paper towel. To lyse the cells and solubilize the chimeric receptors, 100 ul of lysis 
buffer was added to each well. Lysis buffer consisted of 150 mM NaCI containing 50 
mM HEPES (Gibco), 0.5 % Trlton-X 100 (Gibco), 0.01 % thlmerosal, 30 KlU/ml 
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aprotinin (ICN Biochemicals, Aurora om i™m a 

' " urora ' OH), ImM 4-<2-aminoethyl -benzenesulfonyl 
fluonde hydrochloride (AEBSF; , CN Biochemical*), 50 uM leupeptin (ICN 
B,ochem 1 ca.s). and 2 mM sodium orthovanadate (Na 3 V0 4: Sigma CheZ, Co S " 
Lou,s, MO), pH 7.5. The plale was then agitated gently on a p.ate shaker (Belico 
Instruments. Vineland. NJ) .or 60 min. a. room temperature. 

While the cells were being solubilized. an ELISA microliter plate (Nunc 
Max.sor P , Inter Med. Denmark) coated overnight at 4'C with the 5B6 monoclonal antl- 
gD antibody (5.0 ug/ml in 50 mM carbonate buffer, pH 9.6, 100 ul/well) was 
decanted, tamped on a paper towel and blocked with 150 ul/well of Block Buffer [PBS 
containing 0.5 % BSA (Intergen Company, Purchase, NY) and 0.01 % thimerosal] for 
60 min. at room temperature with gentle agitation. After 60 minutes, the antl-gD 5B6 
coated plate was washed 6 times with wash buffer (PBS containing 0.05 % Tween-20 
and 0.01 % thimerosal) using an automated plate washer (ScanWasher 300, Skatron 
Instruments, Inc, Sterling, VA). 

The lysate containing solubilized MPL/Rse.gD from the cell-culture microtfter 
well was transferred (85 ul/well) to anti-gD 5B6 coated and blocked ELISA well and 
was .ncubated for 2 h at room temperature with gentle agtetion. The unbound mpl- 
Rse.gD was removed by washing with wash buffer and 100 ul of biotinylated 4Q10 
(ant.-phosphotyrosine) diluted 1:18000 in dilution buffer (PBS containing 0 5 % 
BSA. 0.05 % Tween-20, 5 mM EDTA. and 0.01 % thimerosal), i.e. 56 ng/m. was 
added to each well. After incubation for 2 hr at room temperature the plate was washed 
and 100 ul of horseradish peroxidase (HRPO)-conjugated streptavldln (Zymed 
Laboratories, S. San Francisco. CA) diluted 1:60000 in dilution buffer was added to 
each well. The plate was incubated for 30 minutes at room temperature wtth gentle 
agnation. The free avidin-conjugate was washed away and 100 ul freshly prepared 
substrate solution (tetramethyl benzidine [TMB]; 2-component substrate kit 
Klrkegaard and Perry. Gaithersburg, MD) was added to each well. The reaction was 
allowed to proceed for 10 minutes, after which the color development was stopped by 
the addmon of 100 ul/Wei. 1.0 M H 3 P0 4 . The absorbance at 450 nm was rea^with a 
reference wavelength of 650 nm (ABS 450/650 ), U8ing a vmax p(ate ^ 
(Molecular Devices, Palo Alto. CA) controlled with a Macintosh Centris 650 (Apple 
Computers. Cupertino. CA) and DeltaSoft software (BioMetaJlica. Inc, Princeton. NJ) 

The standard curve was generated by stimulating dp12.trkA,B or C.gD cells 
with 200 50. 12.5, 3.12. 0.78, 0.19. 0.048 or 0 ng/m. mpl „gand and presented as 
ng/ml TPO vs. mean ABS 450 /650 ± sd using the DeltaSoft program. Sample 
concentrations were obtained by interpolation of their absorbance on the standard 
curve and are expressed in terms of ng/ml TPO activity. 
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The mpMigand was found to be able ,o activate the mp/.Rse.gD chimeric 
receptor in a concentration-dependent and ligand-specific manner. Further, the mpA 
T* f KIRA-ELISA was found to be to.eran. of up to 100% human serum (shown, or 
100/. plasma (not shown,, allowing the assay to be used to readiiy screen patient and 

5 pK samples. 
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EXAMPLE 18 

flecepfor Based ELISA for Thrombopotetln (TPO) 

P ,a,es ™>re coated with rabbit F(ab') 2 anti-human IgG (Fc ) in pH 9 6 
carbonate buffer at 4'C overnight. Plates were b.ocked with 0.5 % bovine serum 
albumin m PBS a. room temperature for one hour. Fermenter harvest containing the 
chimeric receptor. mpAIgG, was added to the plates and incubated for 2 hours. Twofold 
senal d.lutions (0.39-25 ng/ml) of the standard (TP0 33 2 produced in 293 cells 
with the concentration determined by quantitative amino acid analysis) and serially 
d.luted samples in 0.5-/. bovine serum albumin. 0.05% tween 20 were added to the 
plates and incubated for 2 hours. Bound TPO was detected with protein A purified, 
biotmyla.ed rabbit antibodies to TP0 155 which was produced In E. coll (1 hour 
incubation), followed by streptavidin-peroxidase (30 min. incubation) and 
3.3,5,5'-tetramethyl benzidine as the substrate. The absorbance was read at 450 nm 
Plates were washed between steps. For data analysis, the standard curve is fitted using 
a four-parameter curve fitting program by Kaleidagraph. Concentrations of the 
samples were calculated from the standard curve. 



20 



EXAMPLE 19 

Expression and Purification of TPO from 293 Cells 

1. Preperation of 293 Cell Expression Vectors 

A.cDNA corresponding to the TPO entire open reading frame was obtained by 
PCR using the following oligonucleotides as primers: 

TABLE 11 



— ■ rwn rrimers 

Cla.FLF: 5" ATC GAT ATC GAT CAG CCA GAC ACC CCGGCCAG 3 1 


(SEQ ID NO: 
65) 


hmotl-R: 5' GCT AGC TCT AGA CAG GGA AGG GAG CTG TAC ATG AGA 3' 


(SEQ ID NO: 
48) 
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r , „ K , H „, waa U8eg flS , emp | ate fQr ^ reacts 

in the presence of pfu DNA polymerase (Stratagene). Initial denaturation was for 7 
mm. at 94-C followed by 25 cycles of amplification (1 min. at 94-C. 1 min at 55 « C 
and 1 min. at 72-C). Rnal extension was for 15 min. at 72'C). The PCR product was 
purified and cloned between the restriction sites Clal and Xbal of the plasmid 
pRKStkneo. a P RK5 derived vector modified to express a neomycin resistance gene 
under the control of the thymidine kinase promote, to obtain the vector 
pRK5tkneo.ORF. A second construct corresponding to the epo homologous domain was 
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generated the same way but using Cla.FL.F as forward primer and the following reverse 
primer: 

Arg.STOP.Xba: 5' TCT AGA TCT AGA TCA CCT GAC GCA GAG GGT GGA CC 3* 

(SEQ ID NO: 66) 

The final construct is called pRK5-tkneoEPO-D. The sequence of both constructs was 
venfied as described in Example 7. 
2. Transfection of Human Embryonic Kidney cells 

These 2 constructs were transfected into Human Embryonic Kidney cells by the 
CaP0 4 method as described in Example 9. 24 hours after transfection selection of 
neomycin resistant clones was started in the presence of 0.4 mg/ml G418.10 to 15 
days later individual colonies were transferred to 96 well plates and allowed to grow to 
confluence Expression of ML 153 or ML332 in the conditioned media from these 
clones was assessed using the Ba/F3-mp/ proliferation assay (described in Example 
I). 

1 5 3. Purification of rhML 332 

293-rhML 332 conditioned media was applied to a Blue-Sepharose 
(Pharmacia) column that was equilibrated in 10mM sodium phosphate pH 7.4 (buffer 
A). The column was subsequently washed wtth 10 column volumes each of buffer A and 
buffer A containing 2M urea. The column was then eluted with buffer A containing 2M 
urea and 1M NaCI. The Blue-Sepharose elution pool was then directly applied to a 
WGA-Sepharose column equilibrated In buffer A. The WGA-Sepharose column was 
then washed with 10 column volumes of buffer A containing 2M urea and 1 M NaCI and 
eluted with the same buffer containing 0.5M N-acetyl-D-glucosamine. The WGA- 
Sepharose eluate was applied to a C4-HPLC column (Synchrom. Inc.) equilibrated in 
0.1% TFA. The C4-HPLC column was eluted with discontinuous propanol gradient (0- 
25%. 25-35%. 35-70%). rhML 332 was found to elute in the 28-30% propanol 
region of the gradient. By SDS-PAGE the purified mMI.332 migrates as a broad band 
in the 68-80 kDa region of the gel(see Figure 15). 
4. Purification of mML 153 

293-rhMLi53 conditioned media was resolved on Blue-Sepharose as described 
for rhML 33 2. The Blue Sepharose eluate was applied directly to a mp/-afflnlty 
column as described above. RhML.153 eluted from the mp/-affinity column was 
purified to homogeneity using a C4-HPLC column run under the same conditions as 
described for rhML3 32 . By SDS-PAGE the purified rhMLi 53 resolves into 2 major 
and 2 minor bands with Mr of -1 8.000-21 ,000(see Figure 15). 
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EXAMPLE 20 
Expression and Purification of TPO from CHO 

1. Description of CHO Expression Vectors 

The expression vectors used in the electroporation protocols described below 
nave been designated: 

pSVIS.ID.LL.MLORF (full length or hTP0 33 2). and 

PSVI5.ID.LLMLEPO-D (truncated or hTP0 153 ). 
The pertinent features of these plasmids are presented in Rg. 23 and 24. 
2. Preparation of CHO Expression Vectors 

A cDNA corresponding to the hTPO entire open reading frame was obtained by 
PCR using the oligonucleotide primers of Table 12. 

TABLE 12 



■ — vector i<uh Primers 

Cla.FL.F2 5' ATC GAT ATC GAT AGC CAG ACA CCC CGG CCA G 3" 


(SEQIDNO: 


ORF. Sal 5" AGT CGA CGT CGA CGT CGG CAG TGT CTG AGA ACC 3* 


47) 
(SEQ ID NO: 
67) 



r — ... K ,„ , ang a; was used as template for the 

reaction in the presence of pfu DNA polymerase (Stratagene). Initial denaturation was 
for 7 min. at 94°C followed by 25 cycles of amplification (1 min. at 94-C, 1 min at 
55°C and 1 min. at 72°C). Final extension was for 15 min. at 72'C) The PCR 
product was purified and cloned between the restriction sites Clal and Sail of the 
plasmid pSVl5.ID.LL to obtain the vector pSVIS.ID.LL.MLORF. A second construct 
corresponding to the EPO homologous domain was generated the same way but using 
Cla.FL.F2 as forward primer and the following reverse primer 

EPOD.Sal 5- AGT CGA CGT CGA CTC ACC TGA CGC AGA GGG TGG ACC 3" 

(SEQ ID NO: 68) 

The final construct is called pSVI5.ID.LLMLEPO-D. The sequence of both constructs 
was verified as described in Example 7 and 9. 

In essence, the coding sequences for the full length and truncated ligand were 
•ntroduced into the multiple cloning site of the CHO expression vector P SVI5 ID LL 
This vector contains the SV40 early promoter/enhancer region, a modified splice unit 
containing the mouse DHFR cDNA. a multiple cloning site for .he Introduction of the 
gene of interest (In this case the TPO sequences described) an SV40 polyadenylation 
s.gnal and origin of replication and the beta-lactamase gene for plasmid selection and 
amplification in bacteria. 
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3. Methodology for Establishing Stable CHO Cell Lines Expressing Recombinant 
Human TPO332 and TPO1S3 

a. Description of CHO parent cell line 

The host CHO (Chinese Hamster Ovary) cell line used for the expression of the 
TPO molecules described herein is known as CHO-DP12 (see EP 307.247 published 
15 March 1989). This mammalian cell line was clonally selected from a transfection 
of the parent line (CHO-K1 DUX-B1 1(DHFR-)- obtained from Dr. Frank Lee of 
Stanford University with the permission of Dr.L. Chasin) with a vector expressing 
preproinsulin to obtain clones with reduced insulin requirements. These cells are also 
DHFR minus and clones can be selected for the presence of DHFR cDNA vector sequences 
by growth on medium devoid of nucleoside supplements (glycine, hypoxanthine, and 
thymidine). This selection system for stably expressing CHO cell lines is commonly 
used. 

b. Transfection method (electroporatlon) 

TPO332 and TPO153 expressing cell lines were generated by transfecting 
DP12 cells via electroporatlon (see e.g. Andreason, G.L J. Tiss. Cult. Meth., 15,56 
[1993]) with linearized pSVI5.ID.LL.MLORF or pSVI5.ID.LL.MLEPO-D plasmids 
respectively. Three (3) restriction enzyme reaction mixtures were set up for each 
plasmid cutting; 10ug, 25ng and 50ug of the vector with the enzyme NOTI by standard 
molecular biology methods. This restriction site is found only once In the vector in the 
linearization region 3' and outside the TPO ligand transcription units (see Fig. 23). 
The lOOjJ reactions were set up for overnight incubation at 37 degrees. The next day 
the mixes were phenol-chloroform-isoamyl alcohol (50:49:1) extracted one time and 
ethanol precipitated on dry ice for approximately one hour. The precipitate was then 
collected by a 15 minute microcentrifugation and dried. The linearized DNA was 
resuspended into 50ul of Ham's DMEM-F12 1:1 medium supplemented with standard 
antibiotics and 2mM glutamine. 

Suspension growing DP12 cells were collected, washed one time In the medium 
described for resuspending the DNA and finally resuspended in' the same medium at a 
0 concentration of 10? cells per 750nl. Aliquots of cells (750ul) and each linearized 
DNA mix were incubated together at room temperature for one hour and then 
transferred to a BRL electroporatlon chamber. Each reaction mix was then 
electroporated in a standard BRL electroporatlon apparatus at 350 volts set at 330uF 
and low capacitance. After electroporation, the cells were allowed to sit in the 
5 apparatus for 5 minutes and then on ice for an additional 10 minute incubation period. 
The electroporated cells were transferred to 60mm cell culture dishes containing 5 ml 
of standard, complete growth medium for CHO cells (High glucose DMEM-F12 50:50 
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without glycine supplemented with 1X nur o m »« . . 

«u mm u qht, 2mM glutamine, and 5% fetal calf serum) 
and grown overnight in a 5% C0 2 cell culture incubator. 
c. Selection and screening method 

The next day, cells were trypsinized off the plates by standard methods and 
transferred to 150mm tissue culture dishes containing DHFR selective medium (Ham's 
DMEM-F12, 1:1 medium described above supplemented with either 2% or 5% dialyzed 
fetal calf serum but devoid of glycine, hypoxanthine and thymidine this is the standard 
DHFR selection medium we use). Cells from each 60mm dish were subsequently 
replated into 5 /150 mm dishes. Cells were then incubated for 10 to 15 days( with 
one medium change) at 37 degrees/5% C0 2 until clones began to appear and reached 
sizes amenable to transfer to 96 well dishes. Over a period of 4-5 days, cell lines 
were transferred to 96 well dishes using sterile yellow tips on a pipettman set at 
50ml. The cells were allowed to grow to confluency (usually 3-5 days) and then the 
trays were trypsinized and 2 copies of the original tray were reproduced. Two of these 
copies were short term stored in the freezer with cells in each well diluted into 50ul 
of 10%FCS in DMSO. 5 day conditioned serum free medium samples were assayed from 
confluent wells in the third tray for TPO expression via the Ba/F cell based activity 
assay. The highest expressing clones based on this assay were revived from storage 
and scaled up to 2 confluent 150mm T-flasks for transfer to the cell culture group for 
2 0 suspension adaptation, re-assay and banking. 
d. Amplification Protocol 

Several of the highest titer cell lines from the selection described above were 
subsequently put through a standard methotrexate amplification regime to generate 
h,gher titer clones. CHO cell clones are expanded and plated in 10cm dishes at 4 
concentrations of methotrexate (/.... 50nM. lOOnM. 200nM and 400nM) at two or 
three cell numbers (105. 5x105. and 106 cells per dish). These cuftures are then 
-ncubated at 37 degree/5% C0 2 until clones are established and amenable to transfer 
to 96 well dishes for further assay. Several high titer clones from this selection were 
again subjected to greater concentrations of methotrexate (/... 600nM 800 nM 
1000nM and 1200nM) and as before resistant clones are allowed to establish and then 
transferred to 96 well dishes and assayed. 

4. Culturing Stable CHO Cell Lines Expressing Recombinant Human TP0 332 and 
TP01S3 

Banked cells are thawed and the cell population is expanded by standard cell 
growth methods in either serum free or serum containing medium. After expansion to 
sufficient cell density, cells are washed to remove spent cell culture media. Cells are 
then cultured by any standard method including; batch, fed-batch or continuous 
culture at 25-40 -C, neutral. pH. with a dissolved 0 2 content of at least 5% until the 
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constitutively secreted TPO is accumulated. Cell culture fluid is then separated from 

the cells by mechanical means such as centrifugation. 

5 Purification of Recombinant Human TPO from CHO Culture Ffuids 

Harvested cell culture fluid (HCCF) is directly applied to a Blue Sepharose 6 
5 Fast Flow column (Pharmacia) equilibrated in 0.01M Na Phosphate pH7.4, 0.1 5M NaCI 
at a ratio of approximately 100L of HCCF per liter of resin and at a linear flow rate of 
approximately 300 ml/hr/cm*. The column is then washed with 3 to 5 column 
volumes of equilibration buffer followed by 3 to 5 column volumes of 0.01 M Na 
Phosphate pH7.4, 2.0M urea. The TPO is then eluted with 3 to 5 column volumes of 

1 0 0.01 M Na Phosphate pH7.4, 2.0M urea, 1.0M NaCI. 

The Blue Sepharose Pool containing TPO is then applied to a Wheat Germ Lectin 
Sepharose 6MB column (Pharmacia) equilibrated in 0.01 M Na Phosphate pH7.4 f 
2.0M urea, and 1.0M NaCI at a ratio of from 8 to 16 ml of Blue Sepharose Pool per ml 
of resin at flow rate of approximately 50 ml/hr/cm*. The column is then washed with 

15 2 to 3 column volumes of equilibration buffer. The TPO is then eluted with 2 to 5 
column volumes of 0.01 M Na Phosphate pH7.4, 2.0M urea, 0.5M N-acetyl-D- 
glucosamine. 

The Wheat Germ Lectin Pool is then adjusted to a final concentration of 0.04% 
C12E8 and 0.1% trifluroacetic acid (TFA). The resulting pool is applied to a C4 
reverse phase column (Vydac 214TP1022) equilibrated in 0.1% TFA, 0.04% C12E8 
at a load. of approximately 0.2 to 0.5 mg protein per ml of resin at a flow rate of 157 
ml/hr/cm2. 

The protein is eluted in a two phase linear gradient of acetonitrile containing 
0.1% TFA, 0.04% C12E8. The first phase is composed of a linear gradient from 0 to 
30% acetonitrile in 15 minutes, The second phase is composed of a linear gradient 
from 30 to 60% acetonitrile in 60 minutes. The TPO elutes at approximately 50% 
acetonitrile. A pool is made on the basis of SDS-PAGE. 

The C4 Pool is then diluted with 2 volumes of 0.01 M Na Phosphate pH7.4 t 
0.1 5M NaCI and diafilitered versus approximately 6 volumes of 0.01 M Na Phosphate 
PH7.4. 0.15M NaCI on an Amicon YM or like ultrafiltration membrane having a 
10,000 to 30,000 Dalton molecular weight cut-off. The resulting diafiltrate may be 
then directly processed or further concentrated by ultrafiltration. The 
diafiltrate/concentrate is adjusted to a final concentration of 0.01% Tween-80. 

All or a portion of the diafiltrate/concentrate equivalent to 2 to 5% of the 
calculated column volume is then applied to a Sephacryl S-300 HR column 
(Pharmacia) equilibrated in 0.01 M Na Phosphate pH7.4, 0.1 5M NaCI, 0.01% Tween- 
80 and chromatographed at a flow rate of approximately 17 ml/hr/cm 2 . The TPO 
containing fractions which are free of aggregate and proteolytic degradation products 

-124- 



20 



25 



30 



35 



WO 95/1885* 



PCT/US94/14553 



are pooled on the basis of SDS-PAGP Tha ..• 

ow AGE - The result| ng Pool Is filtered on a 0.22u filter, 

Millex-GV or like, and stored at 2-8°C. 

EXAMPLE 21 

Transforms and , nd uct,on of TPO Protein Synthesis ,n E. coU 

1 . Construction of E. coli TPO expression vectors 

The ptasmids p MP21 , pMPlsi. pM P4i. pMP57 and pMP202 are all designed 

to express the first 155 amino acids of TPO downstream of a smal. leader which varies 
among the different constructs. The leaders provide primarily for high level 
translation initiation and rapid purification. The plasmlds pMP210-1, -T8, -21 
-22, -24. -25 are designed to express the first 153 amino acids of TPO downstream 
of an initiation methionine and differ only in the codon usage for the first 6 amino acids 
of TPO, while the plasmid P MP251 is a derivative of pMP210-1 in which the carboxy 
terminal end of TPO is extended by two amino acids. All of the above plasmlds will 
produce high levels of intracellular expression of TPO in E. co.i upon induction of the 
tryptophan promoter (Yansura, D. G. at. a,. Methods In Enzywology ( Goeddel D V 
Ed.) 185:54-60, Academic Press, San Diego [1990]). The plasmlds P MP1 and 
PMP172 are intermediates in the construction of the above TPO intracellular 
expression plasmids. 

(a) Plasmid pMP1 

The plasmid P MP1 Is a secretion vector for the first 155 amino acids of TPO 
and was constructed by ligating together 5 fragments of DMA as shown in Fig. 33 The 
f.rst of these was the vector P Pho21 In which the small Mlul-BamHI fragment had 
been removed. P Pho21 is a derivative of P hGH1 (Chang, C. N. at al., Gane 55189- 
196 [1987]) in which the human growth hormone gene has been replaced with the E 
coh phoA gene, and a Mlul restriction site has been engineered Into the coding sequence 
for the STII signal sequence at amino acids 20-21. 

The next two fragments, a 258 base pair Hinfl-Pstl piece of DNA from pRK5- 
^/.(Examp.e 9) encoding TPO amino acids 19-103. and the following synthetic 
DNA encoding amino acids 1-18 



5*- 



TG 



^GCQTATGCCAQCCCGGCTCCTCCTGCTTGTGACCTCCGAGTCCTC^ 



ATACGGTCGGGCCGAGGAGGACGMCACT^ 

ACTGA-5' 

(SEQ ID NO: 69) 
(SEQ ID NO: 70) 
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were preligated with T4-DNA ligase, and second cut with Pstl. The fourth was a 152 
base pair Pstl-Haelll fragment from pRKShmpll encoding amino acids 104-155 of 
TPO. The last was a 412 base pair Stul-BamHI fragment from P dh108 containing the 
lambda to transcriptional terminator as previously described (Scholtissek, S. et al 
NAR 15:3185 [1987]). 

(b) Plasmid pMP2t 

The plasmid pMP21 is designed to express the first 155 amino acids of TPO 
with the aid of a 13 amino acid leader comprising part of the STII signal sequence. It 
was constructed by ligating together three (3) DNA fragments as shown in Fig. 34, 
the first of these being the vector pVEG31 in which the small Xbal-Sphl fragment had 
been removed. The vector pVEG31 is a derivative of pHGH207-1 (de Boer, H. A. et. al. 
, in Promoter Structure and Function (Rodriguez, R. L. and Chamberlain. M. J. , Ed), 
462, Praeger. New York [1982]) in which the human growth hormone gene has been 
replaced by the gene for vascular endothelial growth factor ( this identical vector 
1 5 fragment can be obtained from this latter plasmid). 

The second part in the ligation was a synthetic DNA duplex with the following 
sequence: 



1 0 



20 



5'- CTAG AATTATGAAAAAGAATATCGCATTTCTTCTTAA 

TTAATACTI 1 1 ICI rATAGCGTAAAGAAGAATTGCGC-5 
(SEQ ID NO: 71) 
(SEQ ID NO: 72) 

The last piece was a 1072 base pair Mlul-Sphl fragment from pMP1 encoding 155 
amino acids of TPO. 
25 (c) Plasmid pMP1 51 

The plasmid pMP151 Is designed to express the first 155 amino acids of TPO 
downstream of a leader comprising 7 amino acids of the STII signal sequence. 8 
hlstidines, and a factor Xa cleavage site. As shown in Fig. 35, pMP151 was 
constructed by ligating together three DNA fragments, the first of these being the 
previously described vector pVEG31 from which the small Xbal-Sphl fragment had 
been removed. The second was a synthetic DNA duplex with the following sequence: 



30 



5-CTAGMmTGAAAAA<^TATCGCATTTCATCACCATCACCATCACCATCACATCGAAG 
GTCGTAGCC 

3 5 TTMTACTTTTTCTTATAGCGTAMGTAGTGGTAGTGGTAGTGGTAGTGTAGiCTTC 
CAGCAT-5' 

(SEQ ID NO: 73) 
(SEQ ID NO: 74) 
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acids of TPO P3ir BQ "" SPhl ,ra9ment ' r0m PMP11 encodi "9 154 amino 

acids of TPO. The plasmid P MP11 is identica( t0 pMPl W| , h ^ n Qf 

codon changes in the STII signai sequence this fragment can be obtained from P MP1> 
( a) Plasmid pMP202 
5 The plasmid P MP202 is very similar ,o the expression vector P MP151 with 

the exception that the factor Xa cleavage site in the leader has been replaced with a 
thrombin cleavage site. As shown in Fig. 36, P MP202 was constructed by .igating 
together three DNA fragments. The first of these was the previously described P VEG31 
in which the sma.i Xba,-S P h. fragment had been removed. The second was a synthetic 
1 0 DNA duplex with the following sequence: 

S'-CTAGAATTATGAAAAAGAATATCGCATTTCATCACCATCACCATCACCATCACATCGAA 
CCACGTAGCC 

TTMTACTTTTTCTTATAGCGTAAAGTAGTGGTAGTGGTAGTGGTAGTGTAGCTT 
1 5 GGTGCAT-5' 

(SEQ ID NO: 75) 
(SEQ ID NO: 76) 

The last piece was a 1064 base pair Bgl.-Sphl fragment from the previous.y described 

plasmid pMP1l. 

20 (e) Plasmid pMP1 72 

The plasmid p M P172 is a secretion vector for the first 153 amino acids of 
TPO, and is an intermediate for the construction of pMP210. As shown in Fig 37 
PMP172 was prepared by ligating together three DNA fragments, the firs, of which 
was the vector P LS32lamB in which the small EcoRI-Hindlll section had been removed 
The second was a 946 base pair EcoR.-Hgal fragment from the previously described 
Plasmid PMP11. The last piece was a synthetic DNA dupiex with the following 
sequence: 

S'-TCCACCCTCTGCGTCAGGT (SE Q ID NO: 77) 

GGAGACGCAGTCCATCGA-S' (SEQ ID NO- 78) 

30 (t) Plasmid pMP210 

The plasmid pMP210 Is designed to express the first 153 amino acids of TPO 
after a translation^ initiation methionine. This plasmid was actually made as a bank 
of plasmlds in which the first 6 codona of TPO were randomized in the third position of 
each codon. and was constructed as shown in Fig. 38 by the ligation of three DNA 
fragments. The first of these was the previously described vector pVEG31 in which the 
small Xba.-Sphl fragment had been removed. The second was a synthetic DNA dupex 
shown below treated first with DNA polymerase. (Klenow) foliowed by digestion with 



25 



35 



-127- 



WO 95/18858 



PCT/US94/14553 



1 5 



20 



25 



30 



Xbal and Hinfl, and encoding the initation methionine and the randomized first 6 codons 
of TPO. 

5'-GCAGCAGTTCTAGMTTATGTCNCCNGCNCCNCCNGCNTGTGiACCTCCGA 

ACACTGGAGGCT 

GTTCTCAGTAAA (SEQ ID NO: 79) 

CAAGAGTCATTTGACGAAGCACTGAGGGTACAGGAAG-5- (SEQ ID NO: 80) 

The third was a 890 base pair Hinfl-Sphl fragment from pMP172 encoding amino 
1 0 acids 19-153 of TPO. 

The plasmid pMP210 bank of approximately 3700 clones was retransformed 
onto high tetracycline (50 ng/ml) LB plates to select out high translational initiation 
clones (Yansura. D. G. et. a,., Methods: A Companion to Methods in Enzymology 4:151- 
158 [1992]). Of the 8 colonies which came up on high tetracycline plates, five of the 
best in terms of TPO expression were subject to DNA sequencing and the results are 
shown in Fig. 39 (SEQ ID NOS: 23, 24, 25, 26, 27 and 28). 
(g) Plasmid pMP41 

The plasmid P MP41 is designed to express the first 155 amino acids of TPO 
fused to a leader consisting of 7 amino acids of the STII signal sequence followed by a 
factor Xa cleavage site. The plasmid was constructed as shown in Fig. 40 by ligating 
together three pieces of DNA. the first of which was the previously described vector 
PVEG31 in which the small Xbal-Sphl fragment had been removed. The second was the 
following synthetic DNA duplex: 

5'.CTAGAATTATGAAAAAGAATATCGCATTTATCGAAGGTCGTAGCC (SEQ ID NO: 81) 
TTAATACTTTTTCTTATAGCGTAAATAGCTTCCAGCAT-5' (SEQ ID NO" 82) 
The last piece of the ligation was the 1064 base pair Bgll-Sphl fragment from the 
previously described plasmid pMP11. 
(h) Plasmid pMP57 

The plasmid pMP57 expresses the first 155 amino acids of TPO downstream of 
a leader consisting of 9 amino acids of the STII signal sequence and the dibasic site Lys- 
Arg. This dibasic site provides for a means of removing the leader with the protease 
ArgC. This plasmid was constructed as shown In Fig. 41 by ligating together three 
DNA pieces. The first of these was the previously described vector pVEG31 in which 
the small Xbal-Sphl fragment had been removed. The second was the following 
3 5 synthetic DNA duplex: 

5'-CTAGAATTATGAAAAAGAATATCGCATTTCTTCTTAAACGTAGCC (SEQ ID NO: 83) 
TTMTACTTTTTCTTATAGCGTAAAGAAGAATTTGCAT-5' (SEQ ID NO: 84) 
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The last part of the ,i gati on was the 10 64 base pair Bgll-Sph. fragment from the 
previously described plasmid pMPH. 
(i) Plasmid pMP251 

5 amino ^ "T" ^ * * °' pMP21£H in ^ two additional 

5 m,n ac lds of TPO are inc.uded on the carboxy term.na, end. As shown in F, g .42 
h s Pla s mid was constructed by ,i gating together two pjeces of QNA ft- ^ Qf ' 
b*ng , he previously described p MP2 l in which the sma|| Xba , Apa| {fagment ^ 

n re P :; v 2 " .r second part of ,he ii9ation was a 316 base pair ^ 

10 2. Transformation and Induction of E. coli with TPO expression vectors 

44C6 (W3110 tonA, rpoH ts ,on A CpP A galE) using the CaCI 2 heat shock method 
(Mandel, M. eta,., J. Mo , S/o,., 53:159-162. (1970]). ^ transforme(j ^ 
grown f,rs« at arc in LB media containing 50 ,g,m< carbenicillin um(| ^ ^ 

1 5 density ( 6 00nm) of the culture reached approximate, 2 -, me LB cu.ture wa me 

d„u ed 2 0x ,n,o M9 media containing 0.49% casamino acids (w/v, and 50 . 9 /m. 
carben c,,!,. After growth with aeration a, 30»C for 1 hour, 
was added to a fina. concentration of 50 ng/ml. The cu.«ure was then al.owed to 
connnue growing at 30X with aeration for another 15 hours at which time ,he ce.,s 

2 0 were harvested by centrifugation. 

EXAMPLE 22 

Production of Biologically Active TPO (Met* 1-153) In E. coll 
The procedures given below for production of biologically active, refolded TPO 
(met- 1-153) can be applied in anaiogy for the recovery of other TPO variants 
mcludmg N and C terminal extended forms (see Example 23). 
A Recovery of non-soluble TPO (Met 1 1-153) 

E. coli ce..s expressing TPO (Met'1 1-153) encoded by the plasmid pMP210- 
M M as described above. Typically, about 100g of cel te are suspended in 
1 L (10 volumes) of cel. disruption buffer (10 mM Tris. 5 mM EDTA, P H 8) with a 
PCytron homogenizer and the ce.ls centrifuged a. 5000 x g for 30 minutes The 
washed cei, pellet is again .suspended in 1 L cel. disruption buffer with the Poiytron 
homogenizer and the ce„ suspension is passed through a, LH Cel, Disrupter^ 

nceltech Inc.) or through a Microfluidizer (Microfiuidlcs International) according to 
he man^ . |nstruct , ons Thfl ^ ^ ^ * 

a d ^suspended and centrifuged a second time to make a washed refractlle body peliet 
The washed pellet is used immediately or stored frozen at -70'C. 
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B. Solubilization and purification of monomeric TPO (Mer> 1-153) 

PH 8 2 68 T 6 " reSUSP6nded i0 5 V °' UmeS by W8i9hf °' 20 ™ ™s, 

PH 8. w„h 6-8 M guanine and 25 mM DTT (di.hiothreitol) and stirred for 1-3 hr 

urea (6-8M) are also useful bu, genera.ly resu.t in lower yields compared to 

a soiubi,i2ation ' ,he so,ution is -~ - »«» - . - *> 1 
.uZT, superna,ant con,ainin9 dena,ured ' Tp ° p- tei - 

PhTar? fi ,hen Chr ° mat09raphed °" * 200 9 e, filtration co.umn 

(Pharmaca 2.6 x 60 cm) at a flow rate of 2 ml/min. and the protein eluted with 20 

Z2ZT*' PH 60, with 10 mM 0TT Fractions c - ai ^ 

o r; r0te ' n e ' U,in9 b6,W6en Bnd 200 m ' " ^ TPO protein is 

further purged on a semi-preparative C4 reversed phase column (2 x 20 cm 

VYDAc . The sampie is applied 5 m,/min. to a co.umn equilibrated in 0.1% 
TFA(tr,f,uoroacet,c acid) with 30% acetonitrile. The protein Is e.uted with a linear 
graven, of acetonitrile (30-60% In 60 min.). The purified reduced protein e.utes a, 
approx.mate.y 50% acetonitrile. This materia, is used for refolding to obtain 
biologically active TPO variant. 

C Generation of biologically active TPO (MeH 1-153) 

Approximately 20 mg of monomeric, reduced and denatured TPO protein in 40 

iTT- acetonj,rile is diluled int0 360 m ' 01 refo,ding ™ ~ 

optimally the following reagents: 
50 mM Trls 
0.3 M NaCI 
5 mM EDTA 
2% CHAPS detergent 
25% glycerol 
5 mM oxidized glutathione 
1 mM reduced glutathione 
pH adjusted to 8.3 

After mixing, the refolding buffer is gently stirred at 4'C for 12-48 hr to 
effect maximal refolding yields of the correct disulfide-bonded form of TPO (see 
be,ow). The solution is then ac.dlf.ed with TFA ,o a fina, concen.rat.on of 0. 2% 
f.l ered trough a 0.45 or 0.22 m.cron filter, and 1/10 volum. of acetonitrile added' 
Th,a , so.ut.on is then pumped direct.y onto a C4 reversed phase co.umn and the 
purified refolded TPO ( M ef1 M53) e.uted with the same gradient program as 
above. Refolded bio.ogica..y active TPO is eluted a, approxUnateiy 45% ace,onltr,.e 
under these cond,t,ons. Improper disulfide-bonded versions of TPO are eluted earHer 



130- 



WO 95/18858 



PCIYUS94/14553 



The final purified TPO (MeH ie . ^ 

ge, and ana.ytica. C4 eve se T" " "* PW " " B ~- - by S ° S 

purified ma.el T chromatography. For animal studies, the C4 

rt (1 0 I N a,y26d PhySi0,09iCa " y «"-«■"• ^ '-tonic 
5 sap PH 5 ' 5 - 10 mM Na ~ PH S.S or to mM Na 

Phosphate, pH 7.4) conta.n.ng 150 mM NaCI and 0.01% Tween 80 were utllfced 

stimu W POtenCV ° f TP ° in B8/F3 3SSay 

mulat.on ,s ach.eved a, approximately 3 Pg /ml), it « possible to obtain biologically 
t,e mater,, utilizing many differen , ^ detergent ^ ^ 

1 o ( < 7) e - :t r: conditions oniy a sman amoun « - ^ «« 

« 0/.) .s obta in ed. For commercia, manufacturing processes, it is desirable to have 
re^ ing y,e.ds at leas, 10 %. more preferabiy 30-50% and most preferably >50%. 
Many d,f erent detergents (Triton X-,00, dodecyi-beta-mattoside. CHAPS. CHAPSO 
SDS, sarkosy,. Tween 20 and Tween 80, Zwittergent 3-14 and others, were assessed 

1 5 ^7cZX T° rt hl ' 9h re '° ,din9 y,e,dS - °' ^ *»™ «» CHAPS 
5 fam iy (CHAPS and CHAPSO) were found to be generaily usefu, in the refolding 

react.n to limit protein aggregation and improper disulfide formation. Levels o 

CHAPS greater than 1% were most usefu,. Sodium ch.oride was required for bes, 

y.eds. with the optimal leve.s between 0.1 M and 0.5M. The presence of EDTA (1-5 

20 ot am ° Unt " «*■*" aggregation) which was 

20 observed w.th some preparations. Glycerol concentrations of greater than 15% 

h P r b C oth ,he H ref0Win9 COndUi0nS - maX ' mUm y,eldS ' » *" -entia. to 
have b th o X ,d*ed and reduced g.utathione or oxidized and reduced cysteine as the redox 
reagen p ai , h(gher ^ ^ ^ ^ ^ 

re gent s equa, to or ,n excess over the reduced reagen, member of the redox pair P H 
25 values between 7.5 and about . were optima, for refolding of these TPO varU 

J™T: {e ' 9 ' TT ac8,oni,ril9 • methano,) were to,era,ed * «■"•-« 

of 10-15% or lower. H,gher ,eve,s of organic solvents Increased the amount of 
proper* fo,ded forms. Tris and phosphate buffers were genera.ly usefu 
incubation at 4-C afso produced higher levete of properly folded TPO 

0 used InT'T ° f ^ (baSed 00 ^ ^ °' - d — * TPO 
used in the refolding reaction) are typica, for preparations of TPO that have been 

Purlf-ed trough the flrst C4 step. Active materia, can be obtained when Z ^ 
preparations (e.g. directly after the Superdex 200 cCumn or after the initial 

5 r,T rf V 6XtraCti0n) al,h ° U9h ^ yiSldS ^ - ^ * **™ vnjZ 
5 and interference of non-TPO proteins during the TPO refolding process 

Since TPO (Men 1-153, contains 4 cysteine residues, it is possib,e to 
generate three different disulfide versions of this protein: 

version 1: disulfides between cysteine residues 1-4 and 2-3 
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version 2: disulfides between cysteine residues 1-2 and 3-4 
version 3: disulfides between cysteine residues 1-3 and 2-4 

different ^ """" eXP '° rati ° n de,emiinin9 < eU >^ -vera, 
deferent peaks containing the TP0 prote in were separated by C4 reversed phase 
chromatography. Onlv one of th»*« , . oversea pnase 

d«r«mm.H PBaks had si 9 ni,icar * biological activity as 

ol" d USm ' Ba/F3 aSSay " a*"*"""* «» '**■"» Editions were 

el a Pre,eremia,,y * hat VerSi ° n - ^ *~ COnd "'^ - "«-" 

vers.ons are less than 10-20% of the total monomer TPO obtained. 

The disulfide pattern for the biologically active TPO has been determined to be 
-4 and 2-3 by mass spectrometry and protein sequencing^ e. version 1). Aliquots 
of the various 04-resolved peaks (5-10 nmo.es) were digested with trypsin (1- 2S 
mole ratio of trypsin to protein). The digestion mixture was analyzed by matrix- 
assisted laser desorption mass spectrometry before and after reduction with DTT After 
reduct.cn. masses corresponding to most of the larger tryptic peptides of TPO were 
detected. ,„ the un-reduced samp.es, some of these masses were missing and new 
masses were observed. The mass of the new peaks corresponded basicaHy to the sum of 
the rndividua. tryptic peptides involved in the disulfide pair. Thus it was possible to 
unequivocal assign the disulfide pattern of the re.Cded. recombinant. bio.ogicaMy 
active TPO to be 1-4 and 2-3. This is consistent with the known disulfide pattern of 
the related molecule erythropoietin. 

D. Biological activity of recombinant, refolded TPO (met 1-153) 

Refo.ded and purified TPO (Me.-1 1- 15 3) has activity in both in vitro and /„ 

rrr™ ,n r Ba/F3 assay> ha "- maximai * 

■n^ the Ba,F3 cel.s was achieved at 3.3 pg /ml {0 . 3 pM , ln lne mp , recept0 , base(J 
EL.SA. ha,f-ma XI ma. activity occurred a. 1.9 ng/m. ( 120 P M). ,„ norma, and 
myelosuppressed animals produced by near-ietha. X-radiation. TPO (Mefi 1-153) 
was highly potent (activity was seen at doses as low as 30 ng/mouse, to stimu.ate the 
production of new platelets. 



EXAMPLE 23 

Production of Other Biologically Active TPO Variants In E. coll 
Three different TPO variants produced in E. coli. purified and refolded into 
biological active forms are provided below. 

« Ufifi „ 1 V ' ^ r9S,dUeS ^ bac,eria, -^«ved signal sequence ST.. are 
fused to the N-termina. domain of TPO (residues 1-155). The resulting sequence i, 
M KKNIAR-LNAYA SPAPPAC CVRRA (SEQIDNO-85) 
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where the leader sequence is underlined and C C represents Cys? through Cys^ 

Th,s vanant was constructed to provide a tyrosine for radio-iodinatlon of TPO for 

receptor and biological studies. 

( 2 ) H8MLF - 7 residues from the STII sequence. 8 histidina residues and 
Factor Xa enzymatic cieavage sequence IEGR are fused to the N-terminal domain 
(res.dues 1-155) of TPO. The sequence is 

MKKNIAFHHHMHHHHIF^RSPAPPAC CVRRA (SEQ ID NO: 86) 

where the leader sequence is underlined and C C represents Cys? through Cy S 151 

Th,s vanant. when purified and refolded, can be treated with the enzyme Factor Xa 
wh,ch will cleave after the arginine residue of the sequence IEGR yielding a TPO 
vanant of 155 residues in length with a natural serine N-terminal amino acid. 

( 3 ) T-HBMLF - is prepared as described above for variant (2), except a 
thrombin sensitive sequence IEPR is fused to the N-terminal domain of TPO. The 
resulting sequence is 

15 MKKNIAFHHHHHHHHIFPRSPAPPAC CVRRA (SEQ ID NO - 87) 

where the leader sequence is underlined and and C C represents Cy S 7 through 

Cys 51. This variant> aft6r purifjcatjon and refo|d . ng ^ ^ treated ^ e ^ 

thrombin to generate a natural N-terminal variant of TPO of 155 residues in length 
A Recovery, solubilization and purification of monomeric, biologically active TPO 
2 0 variants (1), (2),and (3). 

All of the variants were expressed in E. coll. The majority of the variants were 
found in retractile bodies, as observed in Example 22 for TPO (Mef 1-153) 
Identical procedures for the recovery, solubilization and purification of monomeric 
TPO variants was achieved as described in Examp.e 22. .dentica. refolding 
conditions to those used for TPO (Mefi 1-153) were used wtth overall yields of 30- 
50%. After refolding, the TPO variants were purified by C4 reversed phase 
chromatography in 0.1% TFA utilizing an acetonitri.e gradient as described 
previously. All of the TPO variants (in their unprotected forms) had biological 
activity as assessed by the Ba/F3 assay, with half-maximal activities of 2-5 pM 
3 0 B. Proteolytic processing of Variants (2) and (3) to generate authentic N- 
terminal TPO (U155). 

JPO variants (2, and (3) above were designed with an enzymaticaiiy- 
cleavable leader peptide before the norma. N-termina) amino acid residue of TPO 
After refolding and purification of variants (2) and (3, as described above, each was 
35 subjected to digestion with the appropriate enzyme. For each variant, the acetonftrile 
from the C4 reversed phase step was removed by biowing a gentle stream of nitrogen 
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on the solution. Thereafter the two variants were treated with either Factor Xa or 
thrombin as described below. 

For TPO variant (2), 1 M Tris buffer. pH 8, was added to the acetonitrile-free 
solution to a final concentration of 50 mM and the pH was adjusted to 8 if necessary. 
5 NaCI and CaCI 2 were added to 0.1 M and 2 mM, respectively. Factor Xa (New England 
Biolabs) was added to achieve about a 1:25 to 1:100 mole ratio of enzyme to variant. 
The sample was incubated at room temperature for 1-2 hr. to achieve maximal 
cleavage as assessed by a change In migration on SDS gels representing the loss of the 
leader sequence. Thereafter, the reaction mixture was purified by C4 reversed phase 
1 0 chromatography using the same gradient and conditions as described above for the 
purification of properly folded variants. Uncleaved variant B was separated from 
cleaved variant (2) by these conditions. The N-terminal amino acids were shown to be 
SPAPP, indicating that removal of the N-terminal leader sequence was successful. 
Factor Xa also generated variable amounts of an internal cleavage within the TPO 
domain; cleavage was observed after the arginine residue at position number 118 
generating an additional N-terminal sequence of TTAHKDP(SEQ ID NO: 88). On non- 
reducing SDS gels, a single band at approximately 17000 daltons was observed for the 
Factor Xa cleaved variant; on reducing gels two bands were seen of molecular weight of 
approximately 12000 and 5000 daltons, consistent with cleavage at arginine 118. 

2 0 This observation also confirmed that the two parts of the molecule were held together 

by a disulfide bond between the 1st and 4th cysteine residues, as deduced from the 
tryptic digestion experiments described above. In the Ba/F3 biological assay, the 
purified TPO (1-155) variant, after removal of the N-terminal leader sequence and 
with the Internal cleavage, had a half-maximal activity of 0.2 to 0.3 picomolar. The 
intact variant with the leader sequence had a half-maximal activity of 2-4 picomolar. 

For variant (3), the digestion buffer consisted of 50 mM Tris. pH 8. 2% 
CHAPS, 0.3 M NaCI, 5 mM EDTA and human or bovine thrombin (Calbiochem) at a 
1:25 to 1:50 by weight of enzyme to TPO variant protein. Digestion was conducted at 
room temperature for 2-6 hours. The progress of the digestion was assessed by SDS 

3 0 gels as described above for the Factor Xa cleavage reaction. Generally, more than 90% 

cleavage of the leader sequence was achieved in this time. The resultant TPO was 
purified on C4 reversed phase columns as described above and was shown to have the 
desired N-terminal by amino acid sequencing. Only very minor (<5%) amounts of an 
Internal cleavage at the same arginine-threonine bond as observed above with Factor 
35 Xa was obtained. The resultant TPO protein had high biological activity with half- 
maximal responses In the Ba/F3 assay at 0.2-0.4 picomolar protein. In the mpl 
receptor based ELISA, this protein had a half-maximal response at 2-4 ng/ml purified 
protein (120-240 picomolar) while the intact variant containing the leader sequence 

- 1 34- 



25 



WO 95/18858 



PCT/US94/14553 



1 0 



1 5 



20 



25 



30 



was .ess potent in both assays by 5-10 fold. For anima, studies, the HPLC-purified 
cleaved protein was dialyzed into physiologica. acceptab.e butters, with ,50 mM Nad, 
.1 /. Tween 80 and 10 mM sodium succinate, P H 5.5. or 10 mM sodium acetate. pH 
5.5. or mM sodium phosphate. pH 7.4. By HPLC and SDS gels, the purified protein 
was stabie tor severai weeks when stored at 4'C. In norma, and myelosuppressed mice 
th,s purified TPO with the authentic N-terminal sequence was highly active 
st.mu.atmg the production of platelets at doses as low as 30 ng/mouse. 

EXAMPLE 24 
Synthetic mpl Llgand 
Although Human mpl ligand (hML) is usually made using recombinant methods 
•t can also be synthesized via enzymatic ligation of synthetic peptide fragments using 
methods described below. Synthetic production of hML allows the incorporation of 
unnatural amino acids or synthetic functionalities such as polyethylene glycol 
Previously, a mutant of the serine protease subtilisin BPN, subtiligase 
(S221C/P225A) was engineered to efficiently ligate peptide esters in aqueous solution 
(Abrahmsen et a/., Biochem.. 30:4151-4159 [1991]). „ has now been shown that 
synthetic peptides can be enzymatically ligated in a sequential manor to produce 
enzymatically active long peptides and proteins such as ribonuclease A (Jackson eta, 
Scenes, [1994]). This technology, described in more detail below, has enabled us to 
chemically synthesize long proteins that previously could be made only with 
recombinant DNA technology. 

A general strategy for hML 153 synthesis using subtiligase is shown (Scheme 
D. Beginning with a fully deprotected peptide corresponding to the Cterminai 
fragment of the protein, an N-term.nal protected. Cterminai activated ester peptide 
is added along with subtiligase. When the reaction is complete, the product is isolated 
by reverse phase HPLC and the protecting group is removed from the N-terminus The 
next peptide fragment is ligated. deprotected and the process is repeated using 
successive peptides until full length protein is obtained. The process is similar to 
soHd phase methodology in that an N-terminal protected C-terminal activated petide is 
(.gated to the N-terminus of the preceding peptide and protein is synthesized in a C->N 
direction. However because each coupling results in addition of up to 50 residues and 
the products are isolated after each ligation, much longer highly pure proteins can be 
synthesized in reasonable yields. 
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Scheme 1. Strategy for Synthesis of hML Using Subtiligase 

R-NH-Peptide 2 -co-R' + H 2 N-Peptide r C0 2 

| 1 ) Subtiligase 

R-NH-Peptide 2 -CO-NH-Peptide r co 2 

| 2) Zn/CH 3 C0 2 H 

H 2 N-Peptide2-CO-NH-Peptide r co 2 



I 3) 



'2 

repeat 1 + 2 

H 2 N-Peptide 3 -co-NH-Peptide 2 -co-NH-Peptide r co ; 

o 



^ O (CH2) 4 



NH 2 



NH 



2 



Based on our knowledge of the sequence specificity of the subtiligase as well as 
the amino acid sequence of the biologically active "epo-domain- of hML, we divided 
hML 1S3 into seven fragments 18-25 residues in length. Test ligation tetrapeptides 
were synthesized to determine suitable ligation junctions for the 18-25mWs. Table 
13 shows the results of these test ligations. 

TABLE 13 

hML Test Ligations. Donor and nucleophile peptides were dissolved at 10 mM in 
100 mM tridna (pH 7.8) at 22'C. Llgase was added to a final concentration of 10 uM 
from a 1.6 mg/mL stock (-70 iiM) and the ligation allowed to proceed overnight. 
Yields are based on % ligation vs. hydrolysis of the donor peptides. 
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I Site 



1 (23/24) 



(22/23) 



2 (46/47) 



3 (69/70) 

4 (89/90) 



Ponor fqlc-K-K/H,»i 



HVLH 

(SEQ ID NO: fto^ 



SHVL 

ISEQ ID NO: Q|i 




SRLS 
(SEQ ID NO: 90) 



AVDF 

(SEQ ID NO Q^ji 



(88/89) 



(90/91) 



AVTL 
(SEQ ID NO: flsi 
LSSL 

(SEQ ID NO: 97) 



HSRL 

(SEQ ID NO: 92) 



SLGE 

(SEQ ID NO: 94) 



48 



52 



22 



78 



C(acm)LSS 
(SEQ ID NO: ! 



(88/89) 



5(107/108) 



SSLL 

(SEQ ID NO: 101) 



CLSS 

(SEQ ID NO: 103) 



(106/107) 



6(128/129) 



LQSL 
(SEQ ID NO: ■ 



ALQS 

.(SEQ ID NOMOfi) 



NAIF 
[SEQIDNO: 108) 



LLEG 
(SEQ ID NO: 96) 
LGQL 

ISEQ ID NO: 98) 



LLGQ 
[SEQIDNO: 1QQ) 
GQLS 

(SEQ ID NO: 109) 



LLGQ 

(SEQ ID NO: 100) 



LGTQ 
SEQ ID NO: 10fi) 
LLGT 

(SEQ ID NO: 107) 



LSFQ 
SEQ ID NO: low 



53 



95 



47 



05 



00 
45 



00 



55 



90 



1 0 



99 
70 



01 



30 



60 



40 



Based on these experiments, the ligation peptides indicated in Table 14 should 

term.nus o each donor ester peptide was needed to prevent se«, ig al. w. c£ I 
(iNOC) protecting group (Veber ef a,., J. Org. cL 42 3286 328^ 

rrr: u J - ^ ~ 

termi us for subsequent iigations. A g.ycolate.»ys yl .amide (g,cK.NH 2 , ester waa 

Z TZ y tL iy sub,i " 3ase -*.*—.••-«..£ 

[1*1]). The .NOC-protected, g.^K-amide activated peptides can be synthesized 
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using standard solid phase methods as outlined (Scheme 2). The peptides are then 
seqen.ia.ly ligated until the full protein is produced and the final product refolded /„ 
wfro. Based on homology with EPO, disulfide pairs are believed to be formed between 
cystine residues 7 and 151 and between 28 and 85. Oxidation of the disulfides may be 
accomplished by simply stirring the reduced material under an oxygen atmosphere for 
several hours. The refolded material can then be purified by HPLC and fractions 
contains active protein pooled and lyophilized. As an alternative, disulfides can be 
differentially protected to control sequential oxidation between specific disulfide pairs. 
Protection of cysteines 7 and 151 with acetamidomethyl (acm) groups would ensure 
oxidation of 28 and 85. The acm groups could then be removed and residues 7 and 151 
oxidized. Conversely, residues 28 and 85 could be acm protected and oxidized in case 
sequential oxidation is required for correct folding. Optoinally, Cysteins 28 and 85 
may be substituted with another natural or unnatural residue other than Cys to insure 
proper oxidation of cysteins 7 and 151. 

TABLE 14. 

Peptide Fragments Used For Total Synthesis of h-ML Using Subtlllgase 

Fragment 
20 Seouenca 

1 (SEQ ID NO: 110) 



1 5 



25 



35 



iNOC-HN.SPAPPACDLRVLSKLLRDSHVL-glc-K-NH 2 (1-22) 
2 (SEQ ID NO: 111) 



INOC-HN-HSRLSQCPEVHPLPTPVLLPAVDF-glc-K-NH 2 (23-46) 
3 0 3 (SEQ ID NO: 1 12) 

iNOC-HN-SLGEWKTQMEETKAQDILGAm.glc-K.NH2 (47-69) 
4 (SEQ ID NO: 113) 



iNOC-HN-LLEGVMAARGQLGPTCLSSLL.glc-K-NH2 (70-90) 



-138- 



WO 95/18858 



PCT/US94/14553 



5 (SEQ ID NO: 114) 

iNOC-HN-GQLSGQVRLLLGALQS-glc-K.NH 2 (90-106) 
5 6 (SEQ ID NO: 115) 

iNOC-HN-LLGTQLPPQGRTTAHKDPNAIF.glc-K-NH2 (107-128) 
7 (SEQ ID NO: 116) 



1 0 



1 5 



20 



25 



30 



H2N-LSFQHLLRGKVRFLMLVGGSTLCVR-C0 2 (129-153) 

Peptide ligations are carried out at 25»C in 100mM tricine, pH 8 (freshly 
prepared and degassed by vacuum filtration through a 5 nM filter). Typically the C 
terminal fragment is dissolved in buffer (2-5 mM peptide) and a 10x stock so.ution of 
subtiligase (1 mg/ml in 100mM tricine. pH 8) is added .0 bring the final enzyme 
concentration to - 5uM. A 3-5 molar excess of the g.c-K-NH 2 activated donor 
peptide is then added as a solid, dissolved, and the mixture allowed to stand at 25°C 
The ligations are monitored by analytical reverse phase C18 HPLC (CH 3 CN/H 2 0 
gradient with 0.1% TFA). The ligation products are purified by preparative HPLC and 
lyophHized. Isonicotinyl (iNOC) deprotection was performed by stirring HCI activated 
zinc dust with the protected peptide in acetic acid. The zinc dust is removed by 
titration and the acetic add evaporated under vacuum. The resulting peptide can be 
used directiy in the next ligation and the process is repeated. Synthetic h M L 153 can 
be floated by procedures analogous to those described above to synthetic or 
recombinant hML 15 4-332 to produce synthetic or semisynthetic full length hML. 

Synthetic hML has many advantages over recombinant. Unnatural side chains 
can be introduced in order to improve potency or specificity. Polymer functionalities 
such as polyethylene glycol can be incorporated to improve duration of action For 
example, polyethylene glycol can be attached to lysine residues of the individual 
fragments (Tab.e 14) before or after one or more iigation steps have been 
performed. Protease sensitive peptide bonds can be removed or altered to improve 
stability in vivo. In addition, heavy atom derivatives can be synthesized to aid in 
structure determination. 
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Scheme 2. Solid Phase Synthesis of Peptide Fragments for Segment 
q Ligation. 



H 2 N. 
R 



1 




2 



c , d (automated synthesis) 



MBHA 
RESIN 



v O 

H 2 N^ (Peptide ^ NH Y^o^ ra >rA/^ N \ e.f (cleavage) 



V RESIN J 

or or \jy 



4 



5 O R o (CH2) 4 

1 . I I NH 2 | 
| I r 1 

Isonicotinyl (iNOC) glycolate-lysyl-amide (glc-K-NH,) 

a) Lysyl-paramethylbenzhydrylamine (MBHA) resin 1 (0.63 meq./gm., Advanced 
ChemTech) is stirred with bromoacetic acid (5 eq.) and diisopropyl carbodilmide (5 
eq.) for 1 h. at 25'C in dimethylacetamide (DMA) to afford the bromoacetyl derivative 
2. b) The resin is washed extensively with DMA and Individual Boc-protected amino 
acids (3 eq., Bachem) are esterified by stirring with sodium bicarbonate (6 eq.) in 
dimethylformamide (DMF) for 24 h. at 50'C to afford the corresponding glycolate- 
phenylalanyl-amide-resin 3. The amino acetylated resin 3 Is washed with DMF <3x) 
and dichloromethane (CH 2 CI 2 ) (3x) and can be stored at room temperature for several 
months. The resin 3 can then be loaded Into an automated peptide synthesizer (Applied 
Biosystems 430A) and the peptides elongated using standard solid phase procedures 
(5). c) The N-ct-Boc group is removed with a solution of 45% trifluoroacetic acid in 
CH 2 CI 2 . d) Subsequent Boc-protected amino acids (5 eq.) are preactivated using 
benzotrlazol-1 -yl-oxy-trls-(dimethylamino) phosphonium hexaf luorophosphate 
(BOP, 4 eq.) and N-methylmorphollne (NMM, 10 eq.) in DMA and coupled for 1-2 h. 
e) The final N-a-Boc group is removed (TFA/CH 2 CI 2 ) to afford 4 and the isonicotinyl 
(iNOC) protecting group is introduced as described previously (4) via stirring wtth of 
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=C I 'c r Vl Cafb0nale (3 ^ "* ™ <* « at 

hydro s HF (5 °° dePr °' eCti0n 01 *• * «-t m en» with 

sriitr -0 ^ e,hy,me,hyi su,ide) at °° c ,or 1 h - af, ° rds *° iN0C - 

^^^^T 3C,iVated ^ 5 ^ * •«« * ™~ P— 

confirmed h ( 3 ° N/H2 ° 9rad,en «' ™>- The identity 0 f a., substrates is 

confirmed by mass spectrometry. 
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SUPPLEMENTAL ENABLEMENT 
™ S invention « '.aimed * cabled in accordance with the above specification 
) nd read,ly ava „ab.e references and starting mate*,, Neverthe.ess, Applicants ha 
depose with the American Type Culture Coi.eCion. Rockvllie. Md.. USA (ATCC) the 
cell line listed below: 

Escherichia coli, DHIOB-pBSK-hmn/ 1 in atpp 
deposited February 24, 1994. ' ^ aCC9SS '° n ° RL 69575 ' 

Piasmid. pSVIS.ID.LL.MLORF, ATCC accession no; CRL 75958, deposited 

December 2, 1994; and ^ K u S . iea 

11770 t t*i ce,is ' ml 1/50 mcb (,abe,ed #1594)> atcc aCCeSS '°" ™ 

11770, deposited December 6, 1994. 

Pr.cdu,. and r . gulali0 „ s , hereunaer "» «« 

_c. a a „ ttbl . cultu ,. , or 30 yea „ ^ , W 

b. mad. „ y ATC0 unier , nB lerms M Budapest T,.a,, and l |sa „ an 

. -h= P«n« u.s. p.™. Avatew% of „ ^ *^ 

uZr«c „* r? n *• ta "* n * ■» - »» *- »— 

under ft. auttorfty ot any g.v«nm« m aecortanc. «n g, ,*„ 

dn' ,r, ^ - «»*-» «■ — 

MM. . and ,„ ,„. „, (offl , J£ 

r«°i ° *• * p " ,, and scom — 1 *- • - -t 

appended claims and equivalents thereof. 

Al. references cited herein are hereby expressly incorporated by reference. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

5 (i) APPLICANT: Genentech, Inc. 

Eaton, Dan L. 

de Sauvage, Frederic J. 

1Q (ii) TITLE OF INVENTION : THROMBO PO I ET IN 

(iii) NUMBER OF SEQUENCES; 144 

(iv) CORRESPONDENCE ADDRESS: 
i e (A) ADDRESSEE: Genentech, Inc. 

1! > (BJ STREET : 460 Point San Bruno Blvd 

(C) CITY: South San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 
2Q (F) ZIP: 94080 

(v) COMPUTER READABLE FORM- 

(A) MEDIUM TYPE: 5.25 inch, 360 Kb floppy disk 

(B) COMPUTER : IBM PC compatible E1 ° PPy dlsk 
n (C) DERATING SYSTEM: PC-DOS /MS-DOS 

" < D > SOFTWARE: patin (Genentech) 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
<B) FILING DATE: 
30 (C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/176553 
35 <B) FILING DATE: 03-JAN-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/348657 
<B) FILING DATE: 02-DEC-1994 

4 0 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/185607 

(B) FILING DATE: 21-JAN-1994 

(vii) PRIOR APPLICATION DATA: 
45 <A > APPLICATION NUMBER: 08/348658 

(B) FILING DATE: 02-DEC-1994 

(vii) PRIOR APPLICATION DATA: 
c n (A > APPLICATION NUMBER: 08/196689 

0U (B) FILING DATE: 15-FEB-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/223263 
55 <B> FILING DATE: 04-APR-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/249376 

(B) FILING DATE: 25-MAY-1994 
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(viii) ATTORNEY/ AGENT INFORMATION ■ 
(A) NAME: Winter, Daryl B 
(B> REGISTRATION NUMBER; 32 637 
(O REFERENCE / DOCKET NUMBER- 871P5PCT 

(ix) TELECOMMUNICATION INFORMATION • 
(A) TELEPHONE: 415/225-1249 
|B> TELEFAX: 415/952-9881 
(C) TELEX: 910/371-7168 

(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS* 
(A) LENGTH: 353 amino acids 
O) TYPE: amino acid 
(D) TOPOLOGY: linear 

2q (Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Met Glu Leu Thr Glu Leu Leu Leu Val Val Met Leu Leu Leu Thr 

io 15 

25 Ala Arg Leu Thr Leu Ser Ser P ro A1 a Pro Pro Ala Cys Asp Leu 

25 30 
Arg Val Leu Ser Lys Leu Leu Arg Asp Ser His Val Leu ^ ^ 

^ ^ 45 

30 Arg Leu Ser Gin Cys Pro Glu Val His Pro Leu Pro Thr Pro Val 

55 60 
Leu Leu Pro Ala Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin 
35 70 75 

Met Glu Glu Thr Lys Ala Gin Asp „. Leu Gly Ala Val Thr Leu 

85 90 

4Q Leu Leu Glu Gly Val Met Ala Ala Ar g Gly Gin Leu Gly Pro Thr 

100 105 
Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin Val Arg Leu 

115 120 
45 Leu Leu Gly Ala Leu Gin Ser Leu Leu Gly Thr Gin Leu Pro Pro 



130 135 



Gin Gly Arg Thr Thr Ala His Lys Asp P ro ^ Ala a . phe Leu 
50 " "5 15Q 

Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met Leu 

160 1S5 

Val Gly Gly Ser Thr Leu Cys Val Am a™ n „ 
55 17Q ^ ys vai Hrg Arg Ala Pro Pro Thr Thr 

175 180 
Al- Val Pro Ser Arg Thr. Ser Leu Val Leu Thr Leu Asn Glu Leu 

190 195 
6 0 Pro Asn Arg Thr Ser Gly Leu Leu Glu Thr Asn Phe Thr Ala Ser 



205 210 
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Ala Arg Thr Thr Gly Ser Glv ' p,, t«„ t 

y er biy ^ eu Leu Lys Trp Gin Gin Gly Phe 

225 

Arg Ala Lys Ile Pro G ly Leu Leu Agn Gln Thr Ser ^ ^ 

235 240 
Asp Gin lie Pro Gly Tyr Leu Asn Arg lie Hie Glu Leu Leu Asn 
245 250 2S5 

Gly Thr Arg Gly Leu Phe Pro Gly Pro Ser Arg Arg Thr Leu Gly 
260 265 270 

Ala Pro Asp U e Ser Ser Gly Thr Ser Asp Thr Gly Ser Leu Pro 
275 280 285 

Pro Asn Leu Gin Pro Gly Tyr Ser Pro Ser Pro Thr His Pro Pro 
290 295 30Q 

2 0 Thr Gly Gin Tyr Thr Leu Phe Pro Leu Pro Pro Thr Leu Pro Thr 

305 310 315 



15 



30 



Pro val val Gin Leu His Pro Leu Leu Pro Asp Pro ser Ala Pro 

25 320 325 330 

Thr Pro Thr Pro Thr Ser Pro Leu Leu Asn Thr Ser Tyr Thr His 
335 3 4 o 345 



Ser Gin Asn Leu Ser Gin Glu Gly 
350 353 

(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1795 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) topology: linear 

4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

TCTTCCTACC CATCTGCTCC CCAGAGGGCT GCCTGCTGTG CACTTGGGTC 50 

45 CTGGAGCCCT TCTCCACCCG GATAGATTCC TCACCCTTGG CCCGCCTTTG 100 

CCCCACCCTA CTCTGCCCAG AAGTGCAAGA GCCTAAGCCG CCTCCATGGC 150 

50 

CCCAGGAAGG ATTCAGGGGA GAGGCCCCAA ACAGGGAGCC ACGCCAGCCA 200 
55 GACACCCCGG CCAGAATGGA GCTGACTGAA TTGCTCCTCG TGGTCATGCT 250 

TCTCCTAACT GCAAGGCTAA CGCTGTCCAG CCCGGCTCCT CCTGCTTGTG 300 
60 ACCTCCGAGT CCTCAGTAAA CTGCTTCGTG ACTCCCATQT CCTTCACAGC 350 
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AGACTGAGCC AGTGCCCAGA GGTTCACCCT TTGCCTACAC CTGTCCTGCT 400 
5 GCCTGC^ GACTTTAGCT TGGGAGAATG GAAAACCCAG ATGGAGGAGA 450 
CCAAGGCACA GGACATTCTG GGAGCAGTGA CCCTTCTGCT GGAGGGAGTG 500 

1 0 

ATGGCAGCAC GGGGACAACT GGGACCCACT TGCCTCTCAT CCCTCCTGGG 550 
t g GCAGCTTTCT GGACAGGTCC GTCTCCTCCT TGGGGCCCTG CAGAGCCTCC 600 
TTGGAACCCA GCTTCCTCCA CAGGGCAGGA CCACAGCTCA CAAGGATCCC 650 
20 AATGCCATCT TCCTGAGCTT CCAACACCTG CTCCGAGGAA AGGTGCGTTT 700 
CCTGATGCTT GTAGGAGGGT CCACCCTCTG CGTCAGGCGG GCCCCACCCA 750 

25 

CCACAGCTGT CCCCAGCAGA ACCTCTCTAG TCCTCACACT GAACGAGCTC 800 
a q CCAAACAGGA CTTCTGGATT GTTGGAGACA AACTTCACTG CCTCAGCCAG 850 

AACTACTGGC TCTGGGCTTC TGAAGTGGCA GCAGGGATTC AGAGCCAAGA 900 
3 5 TTCCTGGTCT GCTGAACCAA ACCTCCAGGT CCCTGGACCA AATCCCCGGA 950 

TACCTGAACA GGATACACGA ACTCTTGAAT GGAACTCGTG GACTCr^CC 1000 

40 

TGGACCCTCA CGCAGGACCC TAGGAGCCCC GGACATTTCC TCAGGAACAT 1050 

4 5 CAGACACAGG CCCAACCTCC AGCCTGGATA TTCTCCTTCC 1100 

CCAACCCATC CTCCTACTGG ACAGTATACG CTCTTCCCTC TTCOCCOU: 1150 

5 0 CTTGCCCACC CCTOTGG^C AGCTCCACCC CCTGCTTCCT GACCCTTCTG 1200 

CTCCAACGCC CACCCCTACC AGCCCTCTTC TAAACACATC CTACACCCAC 1250 

55 

TCCCAGAATC TGTCTCAGGA AGGGTAAGGT TCTCAGACAC TGCCGACATC 1300 
6Q AGCATTGTCT CATGTACAGC TCCCTTCCCT GCAGGGCGCC CCTGGGAGAC 1350 
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AACTGGACAA GATTTCCTAC TTTC^CCTra aarw,. 

L * CCTGA AACCCAAAGC CCTGGTAAAA 1400 

s OOCATACACA GGACTGAAAA GGGAATCAI. ^ 
CTTCAGAACC TATTTTTTTA AGCTATCAGC AATACTCATC ACACCACCTA 15 00 
1 0 GCTCTTTGGT CTATTTTCTG CAGAAATTTG CAACTCACTG ATTCTCTACA 1550 
TGCTCTTTTT CTGTGATAAC TCTGCAAAGG CCTGGGCTGG CCTGGCAGTT 1600 
GAACAGAGGG AGAGACTAAC CTTGAGTCAG AAAACAGAGA AAGGGTAATT 1650 
2q TCCTTTGCTT CAAATTCAAG GCCTTCCAAC GCCCCCATCC CCTTTACTAT 1700 
CATTCTCAGT GGGACTCTGA TCCCATATTC TTAACAGATC TTTACTCTTG 1750 
2 5 AGAAATGAAT AAGCTTTCTC TCAGAAAAAA AAAAAAAAAA AAAAA " 1795 



1 5 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 3: 

U) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 42 amino acids 
(BJ TYPE: amino acid 
(D) TOPOLOGY: linear 



30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Leu Leu Leu Val Val Met Leu Leu Leu Thr Ala Arg Leu Leu 
40 10 15 

S.r Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys 

25 3 
^ Leu Leu Arg Asp Ser His Val Leu His Ser Arg Leu 

35 40 42 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS • 
50 <A) LENGTH: 390 bases 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GAATTCCTGG AATACCAGCT GACAATGATT TCCTCCTCAT CTTTCAACCT 50 
CACCTCTCCT CATCTAAGAA TTGCTCCTCG TGGTCATGCT TCTCCTAACT 100 
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55 



60 
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GCAAGGCTAA CGCTG^CCAP rrrr-n. 

TO-CCAG CCCGGCTCCT CCTGCTTGTG ACCTCCGAGT 150 

CCTCAGTAAA GTGCTTCGTG ACTCCCATGT CCTTCACAGC AGACTGGTGA 200 

1 o GAACTCCCAA CATTATCCCC TTTATCCGCG TAACTGGTAA GACACCCATA 250 

CTCCCAGGAA G AC AC CATC A C^CCTCTAA CTCCTTGACC CAATGACTAT 300 

1 ^ TOCCATA ^CCCAC CTAC^A CACTCTC^A CAAGAATTAT 350 

TCTTCACAAT ACAGCCCGCA TTTAAAAGCT CTCGTCTAGA 390 

20 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS- 
" (A) LENGTH: 390 bases 

fB) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

CTTAAGCACC TTATGGTCGA CTGTTACTAA AGGAGGAGTA GAAAGTTGGA 50 

35 

""^ GTAGATTCTT AACGAGGAGC ACCAGTACGA AGAGGATTGA 100 

4o correct gcgacaggtc gggccgagga ggacgaacac ^aggctca 150 
gcactca™ GACGAAGCAC TGAGGGTACA ggaagtctcg TCTGAC cact 200 

45 CTTGAGGGTT GTAATAGGGG AAATAGGCCC ATTGACCATT CTCTCOVM 250 
GAGGGTCCTT ctgtggtagt caaggaga^ GAGGAAC^ GTTAC^TA 300 

50 

AGAAGGGTAT AACAGGGGTG GATGACTAGT GTGAGAGACT GTTCTTAATA 350 
55 AGAAGTGTTA TC ^GGGCGT AAATTTTCGA GAGCAGATCT 390 

(2) INFORMATION FOR SEQ ID NO: 6: 

60 (i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 332 amino acids 
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1 0 



35 



50 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu 

1 5 10 15 

Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro 

20 25 30 

Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp 



1 5 Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala 

50 55 60 

Gin Asp He Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
20 65 70 75 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu 
80 85 90 

Gly Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin 
25 95 100 105 

Ser Leu Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala 
HO us 120 

3 0 His Lys Asp Pro Asn Ala He Phe Leu Ser Phe Gin His Leu Leu 

125 130 i3 5 



Arg Gly Lys Val Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu 
I 40 145 150 

Cys Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro Ser Arg Thr 
155 160 165 

Ser Leu Val Leu Thr Leu Asn Glu Leu Pro Asn Arg Thr Ser Gly 
40 170 175 180 

Leu Leu Glu Thr Asn Phe Thr Ala Ser Ala Arg Thr Thr Gly Ser 
185 190 195 

4 5 Gly Leu Leu Lys Trp Gin Gin Gly Phe Arg Ala Lys He Pro Gly 

200 205 210 

Leu Leu Asn Gin Thr Ser Arg Ser Leu Asp Gin He Pro Gly Tyr 
215 220 225 

Leu Asn Arg He His Glu Leu Leu Asn Gly Thr Arg Gly Leu Phe 
230 235 240 



Pro Gly Pro Ser Arg Arg Thr Leu Gly Ala Pro Asp He Ser Ser 
55 245 250 255 

Gly Thr Ser Asp Thr Gly Ser Leu Pro Pro Asn Leu Gin Pro Gly 
260 265 270 

60 * Tyr Ser Pro Ser Pro Thr His Pro Pro Thr Gly Gin Tyr Thr Leu 

275 280 285 
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Phe Pro Leu Pro P-o Thr r„ « 

Pjo Thr Leu Pro. Thr Pro Val Val Gin Leu His 

295 300 

5 Pro Leu Leu Pro Asn c 

Asp Pro ser Ala Pro Thr Pro Thr Pro Thr Ser 

Pro Leu Leu Asn Thr Ser ^ Thr His Ser Qla Asn ^ ^ ^ 
10 S2Q 325 330 

Glu Gly 
332 



1 5 



20 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7 : 



Ala Pro Pro Arg Leu lie Cys Asp Ser Ar g Val Leu Glu Arg Tyr 
25 5 10 g ^ 



Leu Leu Glu Ala Lys Glu Ala Glu Asn lie Thr Thr Gly Cys Ala 

3Q Glu His Cys Ser Leu Asn Glu Asn lie Thr Val Pro Asp Thr Lys 

Val Asn Phe Tyr Ala Trp Lys Arg Met Glu Val Gly Gin Gin Ala 

55 60 
35 va! Glu val ftp Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu 

Arg Gly Gin Ala Leu Leu Val Asn Ser Ser Gin Pro Trp Glu 
40 85 go 

Leu Gin Leu His Val Asp Lys Ala Val Ser Gly Le U ^ Ser Leu 

100 10S 
45 Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Lys Glu Ala Xl e Ser 

Pro Pro Asp A1 . Ala Ser Ala Ala Pro L.u Ar g Thr Xle Thr 2 

130 135 
Asp Thr Phe Arg Lys Leu Phe Arg Val Tyr Ser Asn Phe Leu Arg 



145 150 



«y Lys Leu Lys Leu Tyr Thr Gly Glu Ala Cys Arg Thr Gly Asp 

55 160 165 

Arg 
166 
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(2) INFORMATION FOR SEQ ID N0 : 8 : 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 328 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

1 0 Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu 

5 10 :5 



20 



Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro 
20 25 3 0 

Glu val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp 
35 «0 45 

Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala 

55 60 
Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
55 ? 0 75 

2 5 Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu 

80 « 90 

Gly Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin 
95 10 ° 105 

Ser Leu Leu Gly Thr Gin Gly Arg Thr Thr Ala His Lys Asp Pro 

Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu Arg Gly 
125 130 

Arg Phe Leu Met Leu Val Gly Gly Ser Thr Leu Cys Val Arg Arg 



50 



30 95 WO 

Thr 

US 120 
3g „ _ aer „ ne oln His Leu ^ ^ ^ 

130 135 

Leu Cys Val Arg 
145 150 
4 0 Ala Pro Pro Thr Thr Ala Val Pro Ser Arg Thr Ser Leu Val Leu 

160 165 
Thr Leu Asn Glu Leu Pro Asn Arg Thr Ser Gly Leu Leu Glu Thr 
45 175 180 

Asn Phe Thr Ala Ser Ala Arg Thr Thr Gly Ser Gly Leu Leu Lys 



195 

Trp Gin Gin Gly Phe Arg Ala Lys He Pro Gly Le U Leu Asn Gin 



205 



210 



Thr Ser Arg Ser Leu Asp Gin lie Pro ^ ^ Leu ^ ^ g ^ 



220 



55 His Glu 



225 



Leu Leu Asn Gly Thr Arg Gly Leu Phe Pro Gly Pro Ser 



23 5 240 



Arg Arg Thr Leu Gly Ala Pro Asp He Ser. Ser Gly Thr Ser Asp 

60 250 255 
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Thr Gly Ser Leu pro Pro Asn Leu Gln Pro Qly ^ Ser prQ ^ 

265 270 
g Pro Thr His Pro Pr Thr Qly ^ ^ ^ ^ ^ ^ ^ 

" 5 280 285 

Pro Thr Leu Pro Thr Pro Val v.l Gin Leu His Pro Leu Leu Pro 
290 295 3 0 o 

1 0 Asp Pro ser Ala Pro Thr Pro Thr Pro Thr Ser Pro Leu Leu Asn 
305 310 315 

Thr Ser Tyr Thr His Ser Gin Asn Leu Ser Gin Glu Gly 
1 5 320 325 328 

(2) INFORMATION FOR SEQ ID NO: 9: 



20 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2S5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

2 5 Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu 

5 10 15 

Leu Arg Asp Ser His Val Leu His Ser Arg Leu Ser Gin Cys Pro 
30 25 30 

Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp 
35 *° 45 

^ Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala 



55 



60 



Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 
65 ™ 75 

4 0 Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Leu 



80 85 



90 



Gly Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin 
45 95 100 105 

Ser Leu Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala 
110 "5 120 

His Lys Asp Pro Asn Ala lie Phe Leu Ser Phe Gin His Leu Leu 



130 



135 



Arg Gly Lys Asp Phe Trp He Val Gly Asp Lys Leu His Cys Leu 



145 



150 



55 Ser Gin Asn Tyr Trp Leu Trp Ala Ser Glu Val Ala Ala Gly He 

3 160 165 

Gin Ser Gin Asp Ser Trp Ser Ala Glu Pro Asn Leu Gin Val Pro 
60 170 175 180 
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Gly Pro Asn Pro Arg He Pro n,, m . 

185 1U GlR f!P Thr Thr Leu Glu 

0 195 

, «» - ~ *» ^ s „ Trp fc Thr 01n ^ p fc= 

«. P~ = ly His P». L .„ ^ u> ^ ^ ^ ^ ™ 

220 225 
^ Gln Pro Pro jj. Trp ne phe ^ ^ ^ ^ s ^ ^ 

235 240 
Tyr Trp Tnr val g Ma Leu pro ^ wa 

" 250 255 

Pro Cys Gly Pro Ala Pro Pro Pro Ala Ser 
260 265 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 261 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu 
Leu Arg Asp Ser „i 3 Val Leu His Ser ^ ^ ^ ^ ^ 

25 30 
Glu Val His Pro Leu Pro Thr Pro Val Leu Leu Pro Ala Val Asp 

Phe Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys 2 

55 60 
Gin Asp lie Leu Gly Ala Val Thr Leu Leu Leu Glu Gly Val Met 

Ala Ala Arg Gly Gin Leu Gly Pro Thr Cys Leu Ser Ser Leu Le^ 

85 90 
Gly Gin Leu Ser Gly Gin V al Arg Leu Leu Leu Gly Ala Leu Gin 

100 105 
Ser Leu Leu Gly Thr Gin Gly Arg Thr Thr Ala Hi. Lys Asp Pro 

115 120 
As* Ala He Phe Leu Ser Phe Gin His Leu Leu Arg Gly Lys Asp 

130 135 
Phe Trp He Val Gly Asp Lys Leu His Cys Leu Ser Gin Asn ^ 

143 150 
Trp Leu Trp Ala Ser Glu Val Ala Ala Gly „. Gln Ser Gln ^ 



160 165 
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Ser Trp Ser Ala Glu Pro Asn Leu Qln ^ ^ ^ ^ ^ ^ 

175 180 
Ar, II. Pro 01u ji» Asp Thr ^ Thr ^ ^ 

190 195 
Thr Leu ser Trp Thr Leu Thr ^ Agp ^ ^ ^ ^ ^ ^ 



205 210 



1 0 Phe Leu Arg Asn „ ^ ^ ^ ^ ^ ^ ^ ^ 

215 220 225 



Ala Trp He Phe Ser Phe Pro Asn Pro Ser Ser Tyr Trp Thr V.l 
230 235 240 

Tyr Ma Leu Pro s Ser Thr His Leu Ala His Pro Cys G1 y Pro 



20 



245 

Ala Pro Pro Pro Ala Ser 
260 261 

(2) INFORMATION FOR SEQ ID NO: 11: 



250 



255 



(i) SEQUENCE CHARACTERISTICS- 
" (A) LENGTH: 7849 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:U: 

CCCAGCCTCC TTTCTCTTGT TCCCTGGTCA TCCCTGCCTC CCTGTCTCCT SO 

35 

GTCTCrCCCT CCCACACACA CCCACTATCC TCCCAGCTAT CCCTACACCC 100 
4 0 rcCTTCCTAA TCTOGGGAGA CATCTCGTCT GGCTGGACGG GAAAATTCCA 150 
GGATCTAGGC CACACTTCTC AGCAGACATG CCCATCCTTG GGGAGGAGGA 200 
4 5 ACAGGAGAGA GCCTGAGGAA GTTCTGGGGG ACAGGGGGAT GATGGGATCA 250 
AGGTCAGGCC AGGAAGCCCC TGAGGACAGA GACTGTGGGG AGACTGGGAC 300 

50 

WGGAAGAAA GCAAAGGAGC TAGAGCCAGG GCCAAAGGAA AAGGGGGGCC 350 
s s AGCAGGGAGG TATTTGCGGG GGAGGTCCAG CAGCTGTGTT TCCTAAGACA 400 

GGGACACATG GGCCTGGTTA TTCCTCTTGT CACATGTGGA ACGGTAGGAG 450 
6 0 ATGGAAGACG GAGACAGAAC AAGCAAAGGA GGGCCCTGGG CACAGAGGTC 500 
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TGTGTGTGTA GCCATCCAAG CCACTGGACC CCAGCAGACG AGCACCTAAG 550 
5 CTCAGGCTTA ACCCAGTGCA CGTGTGCGCA CATACATGTG CCCCGCACCT 600 
GACAGTCCAC TCAACCCGTC CAAACCCTTT CCCCATAACA CCAACCCATA 650 

1 0 

ACAGGAGATT TCTCTCATGT GGGCAATATC CGTGTTCCCA CTTCGAAAGG 700 
! 5 GGGAATGACA AGATAGGACT CCCTAGGGGA TTACAGAAAG AAAAGCAGGA 750 
AAGCAAGCAT CCTGTTGGAT TTCAGCAGCA GGTATGATGT CCAGGGAAAA 800 
20 GAAATTTGGA TAGCCAGGGA GTGAAAACCC CACCAATCrr AAACAAGACC 850 
TCTGTGCTTC TTCCCCAGCA ACACAAATGT CCTGCCAGAT TCCTCCTGGA 900 

25 

AAAAAACTTC TGCTCCTGTC CCCCTCCAGG TCCCAGGTTG CCCATGTCCA 950 
30 GGAAAAGATG GATCCCCCTA TCCAAATCTT CTCCGTGGTG TGTGTGGGTG 1000 

GAGGAGTGGA CCCTCGTCCA GGCAGGGGCT CCAGGGAAGA GAAGGCGTCA 1050 
35 CTTCCGGGGG CCTTCACCAG TGTCTGGTGG CTCCCTTCTC TGATTGGGCA 1100 

GAAGTGGCCC AGGCAGGCGT ATGACCTGCT GCTGTGGAGG GGCTGTGCCC 1150 

40 

CACCGCCACA TGTCTTCCTA CCCATCTGCT CCCCAGAGGG CTGCCTGCTG 1200 
45 TGCACTTGGG TCCTGGAGCC CTTCTCCACC CGGTGAOTGO CCAGCAGGGT 1250 

GTGGGGTTAT GTGAGGGTAG AAAGGACAGC AAAGAGAAAT GGGCTCCCAG 1300 
5 0 CTGGGGGAGG GGCAGGCAAA CTGGAACCTA CAGGCACTGA CCTTTCTCGA 1350 
GAAGAGTOTA GCCTTCCCAG AATGGGAGGA GCAGGGCAGA GCAGGGGTAG 1400 

55 

GOOGTGGGGT GCTQGTTTCT GAGGGACTGA TCACTTACTT GGTGGAATAC 14S0 
60 AGCACAGCCC TGGCTGGCCC TAAGGAAAGG GGACATGAGC CCAGGGAGAA 1500 
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^ TTAGGGCTTA GCAAACACAG TAGTAAGATG 1550 

g GACACAGCCC CAATCCCCAT TCTTAGCTGG TCATTCCTCG TTAGCTTAAG 1600 
GTTCTGAATC TGGTGCTGGG GAAGCTGGGC CAGGCAAGCC AGGGCGCAAG 1650 
1 0 GAGAGGGTAA TGGGAGGAGG GCCCACTCAT GTTGACAGAC CTACAGGAAA 1700 
TCCCAATATT GAATCAGGTG CAAGCCTCTT TGCACAACTT GTGAAAGGAG 1750 

1 5 

GAGGAAGCCA TGTGGGGGGT CCTGTGAAGG AACCGGAAGG GGrTC^CCA 1800 
2Q AGGGGGCAGG GAGGCAGGTG TGAGCTATGA GACAGATATG TTAGTGGGCG 1 8S0 
CCTAAGACAA GGTAAGCCCC TAAGGTGGGC ATCACCCAGC AGGTGCCCGT 1900 
2 5 TCCTGGGCAG CTGGTCTCAG GAAGGAAGTC CCAGAACTGT TAGCCCATCT 1950 
CTTGGCCTCA GATAATGGAG TATTTCAGGA CTTGGAGTCC AAAGAAAAGC 2000 

30 

TCCAGTGGCT TTATGTGTGG GGGTAGATAG GGAAAGAATA GAGGTTAATT 2050 
g g TCTCCCATAC CGCCTTTTAA TCCTGACCTC TAGTGGTCCC AGTTACAGCT 2100 

TTGTGCAGTT CCCCTCCCCA GCCCCAC TCC CCACCGCAGA AGTTACCCCT 21 50 
4 0 CAACATATTG CGCCCG^TG CCAG^CTC ACCCAGGCCC ^CATCCCAT 2200 

rrrccACTCT crrcrccAGG CTO aagc CAC ^ 

CCATCCCAGA TTTTCTCTGA CCTAACAACC AAGGTTGCTC AGAATTTAAG 2300 
5 0 <KaKnM » ATATGTGTGT ATACATATCA TGTCCTGCTG CTCTCAGCAG 2350 

GGGTAGGTGG CACCAAATCC GTGTCCGATT CACTGAGGAG TCCTGACAAA 2400 
5 5 AAGGAGACAC CATATGCTTT CTTGCTTTCT TTCTTTCTTT CTTTCTTTTT 2450 

TTTTTTTTGA GACGGAGTTT CACTCTTATT GCCCAGGC^ GAGTGCAATG 2500 

60 

GTGCGATCTC GGCTCACCAC AAACCTCCGC CTCCCAGGTA CAAGCGATTC 2550 
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TCCTGTCTCA GCCTCCCMG TAGCTTGGAT TACAGGCATG AGCCACCACA 2600 

5 

CCCTGCTAGT TTTTTTGTAT TTCGTAGAGC CGGGGTTTCA CCATGTTAGT 2650 
1 o GAGGCTGGTG GCGAACTCCT GACCTCAGGT GATCCACCCG CCTTGGACTC 2700 

CCAAAGTGCT GGGATTACAG GCATGAGCCA CTOCACCCGG CACACCATAT 2750 
1 5 GCTTTCATCA CAAGAAAATG TGAGAGAATT CAGGGCTTTG GCAGTTCCAG 2800 
GCTGGTCAGC ATCTCAAGCC CTCCCCAGCA TCTGTTCACC CTGCCAGGCA 2850 

20 

GTCTCTTCCT AGAAACTTGG TTAAATGTTC ACTCTTCTTG CTACTTTCAG 2900 
2g GATAGATTCC TCACCCTTGG CCCGCCTTTG CCCCACCCTA CTCTGCCCAG 2950 

AAGTGCAAGA GCCTAAGCCG CCTCCATGGC CCCAGGAAGG ATTCAGGGGA 3000 
3 0 GAGGCCCCAA ACAGGGAGCC ACGCCAGCCA GACACCCCGG CCAGAATGGA 3050 
GCTGACTGGT GAGAACACAC CTGAGGGGCT AGGGCCATAT GGAAACATGA 3100 

35 

CAGAAGGGGA GAGAGAAAGG AGACACGCTG CAGGGGGCAG GAAGCTGGGG 3150 
40 GAACCCATTC TCCCAAAAAT AAGGGGTCTG AGGGGTGGAT TCCCTGGGTT 3200 

TCAGGTCTGG GTCCTGAATG GGAATTCCTG GAATACCAGC TGACAATGAT 3250 
45 TTCCTCCTCA TCTTTCAACC TCACCTCTCC TCATCTAAGA ATTGCTCCTC 3300 
GTGGTCATGC TTCTCCTAAC TGCAAGGCTA ACGCTGTCCA GCCCGGCTCC 3350 

50 

TCCTGCTTGT GACCTCCGAG TCCTCAGTAA ACTGCTTCGT GACTCCCATG 3400 
5g TCCTTCACAG CAGACTGGTG AGAACTCCCA ACATTATCCC CTTTATCCGC 3450 

GTAACTGGTA AGACACCCAT ACTCCCAGGA AGACACCATC ACTTCCTCTA 3500 
60 ACTCCTTGAC CCAATGACTA TTCTTCCCAT ATTGTCCCCA CCTACTGATC 3550 
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ACACTCTCTG ACAAGAATTA TTCTTCACAA TACAGCCCGC ATTTAAAAGC 3600 
5 TCTCGTCTAG AGATAGTACT CATGGAGGAC TAGCCTGCTT ATTAGGCTAC 3 650 
CATAGCTCTC TCTATTTCAG CTCCCTTWC CCCCCACCAA TCTTTTTCAA 3700 

1 0 

CAGAGCCAGT GCCCAGAGGT TCACCCTTTG CCTACACC TO TCCTGCTGCC 3750 
1 s TGCTGTGGAC TTTAGCTTGG GAGAATGGAA AACCCAGATG GTAAGAAAGC 3800 
CATCCCTAAC CTTGGCTTCC CTAAGTCCTG TCTTCAGTTT CCCACTGCTT 3850 
20 CCCATGGATT CTCCAACATT CTTGAGCTTT TTAAAAATAT CTCACCrTCA 3900 



25 



GCTTGGCCAC CCTAACCCAA TCTACATTCA CCTATGATGA TAGCCTGTGG 3950 
ATAAGATGAT GGCTTGCAGG TCCAATATGT GAATAGATTT GAAGCTGAAC 4000 
3 0 ACCATGAAAA GCTGGAGAGA AATCGCTCAT GGCCA^CCT TTGACCTATT 4050 
CCYGTTCAGT CTTCTTAAAT TGGCATGAAG AAGCAAGACT CATATGTCAT 4100 
3 5 CCACAGATGA CACAAAGCTG GGAAGTACCA CTAAAATAAC AAAAGACTGA 4X50 
" ATCAAGATTC AAATCACTGA AAGACTAGGT CAAAAACAAG GTGAAACAAC 4200 

40 

AGAGATATAA ACTTCTACAT GTGGGCCGGG GGCTCACGCC TGTAATCCCA 4250 
4 5 GCACTTTGGG AGGCCGAGGC AGGCAGATCA CCTQAGGGCA GGAGTTTGAG 4300 

AGCAGCCTGG CCAACATGGC GAAACCCCGT CTCTACTAAG AATACAAAAT 4350 
5 0 TAGCCGGGCA TGGTAGTGCA TGCCTGTAAT CCCAGCTACT TGGAAGGCTG 4400 

AACCACQAGA ATCCCTTGAA CCCAGGAGGT GGAGGTCGTA GTGAGCTGAG 4450 

5 5 

ATCATGCCAA TGCACTCCAG CCTGGGTGAC AAGAGCAAAA CTCCGTCTCA 4500 



6 0 AAAAGAAAAA ATGTGTAAAT TAATGAGTAA AGTCCTATTC 



4550 
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CAGCTTTCAG GCCACAATGC CCTGCTTCCA TCATTTAAGC CTCTGGCCCT 4600 
g AGCACrrCCT ACGAAAAGGA TCTGAGAGAA TTAAATTGCC CCCAAACTTA 4650 
CCATGTAACA TTACTGAAGC TGCTATTCTT AAAGCTAGTA ATTCTTGTCT 4700 
1 0 GTTTGATGTT TAGCATCCCC ATTGTGGAAA TGCTCGTACA GAACTCTATT 4750 
CCGAGTGGAC TACACTTAAA TATACTGGCC TGAACACCGG ACATCCCCCT 4800 

1 5 

GAAGACATAT GCTAATTTAT TAAGAGGGAC CATATTAAAC TAACATGTGT 4850 
2q CTAGAAAGCA GCAGCCTGAA CAGAAAGAGA CTAGAAGCAT GTTTTATGGG 4900 

CAATAGTTTA AAAAACTAAA ATOTATCCTC AAGAACCCTA GCGTCCCTTC 4950 
2 5 TTCCTTCAGG ACTGAGTCAG GGAAGAAGGG CAGTTCCTAT GGGTCCCTTC 5000 
TAGTCCTTTC TTTTCATCCT TATGATCATT ATGGTAGAGT CTCATACCTA 5050 

30 

CATTTAGTTT ATTTATTATT ATTATTTGAG ACGGAGTCTC ACTCTATCCC 5100 
35 CCAGGCTGGA GTGCAGTGGC ATGATCTCAA CTCACTGCAA CCTCAGCCTC 5150 

CCGGATTCAA GCGATTCTCC TGCCTCAGTC TCCCAAGTAG CTGGGATTAC 5200 
40 AGGTGCCCAC CACCATGCCC AGCTAATTTT TGTATTTTTG GTAGAGATGG S250 

GGTTTCACCA TGTTGGCCAG GCTGATCTTG AACTCCTGAC CTCAGGTGAT 5300 

45 

CCACCTGCCT CAGCCTCCCA AAGTGCTGGG ATTACAGGCG TGAGCCACTG 5350 
50 CACCCAGCCT TCATTCAGTT TAAAAATCAA ATGATCCTAA GGTTTTGCAG 5400 

CAGAAAGAOT AAATTTGCAG CACTAGAACC AAGAGGTAAA AGCTGTAACA 5450 
5 5 GGGCAGATTT CAGCAACGTA AGAAAAAAGG AGCTCTTCTC ACTGAAACCA 5500 
AGTGTAAGAC CAGGCTGGAC TAGAGGACAC GGGAGTTTTT GAAGCAGAGG 5550 

60 

CTGATGACCA GCTGTCGGGA GACTGTGAAG GAATTCCTGC CCTGGGTGGG 5600 

-158- 
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ACCTTGGTCC TGTCCAGTTC TCAGCCTGTA TGATTCACTC TGCTGGCTAC 5650 

5 

TCCTAAGGCT CCCCACCCGC TTTTAGTGTG CCCTTTGAGG CAGTGCGCTT 5700 
^ ^ CTCTCTTCCA TCTCTTTCTC AGGAGGAGAC CAAGGCACAG GACATTCTGG 5750 

GAGCAGTGAC CCTTCTGCTG GAGGGAGTGA TGGCAGCACG GGGACAACTG 5800 
1 5 GGACCCACTT GCCTCTCATC CCTCCTGGGG CAGCTTTCTG GACAGGTCCG 5850 

TCTCCTCCTT GGGGCCCTGC AGAGCCTCCT TGGAACCCAG GTAAGTCCCC 5900 

20 

AGTCAAGGGA TCTGTAGAAA CTGTTCTTTT CTGACTCAGT CCCACTAGAA 5950 
25 GACCTGAGGG AAGAAGGGCT CTTCCAGGGA GCTCAAGGGC AGAAGAGCTG 6000 

ATCTACTAAG AGTGCTCCCT GCCAGCCACA ATGCCTGGGT ACTGGCATCC 6050 
3 0 TGTCTTTCCT ACTTAGACAA GGGAGGCCTG AGATCTGGCC CTGGTGTTTG 6100 

GCCTCAGGAC CATCCTCTGC CCTCAGCTTC CTCCACAGGG CAGGACCACA 6150 

35 

GCTCACAAGG ATCCCAATGC CATCTTCCTG AGTTTCCAAC ACCTGCTCCG 6200 
4Q AGGAAAGGTG CGTTTCCTGA TGCTTGTAGG AGGGTCCACC CTCTGCGTCA 6250 

GGCGGGCCCC ACCCACCACA GCTGTCCCCA GCAGAACCTC TCTAGTCCTC 6300 
4 5 ACACTGAACG AGCTCCCAAA CAGGACTTCT GGATTGTTGG AGACAAACTT 6350 

CACTGCCTCA GCCAGAACTA CTGGCTCTGG GCTTCTGAAG TGGCAGCAGG 6400 

50 

GATTCAOAGC CAAGATTCCT GGTCTGCTGA ACCAAACCTC CAGGTCCCTG 6450 
55 GACCAAATCC CCGGATACCT GAACAGGATA CACGAACTCT TGAATGGAAC 6500 

TCGTGGACTC TTTCCTGGAC CCTCACGCAG GACCCTAGGA GCCCCGGACA 6550 
60 TTTCCTCAGG AACATCAGAC ACAGGCTCCC TGCCACCCAA CCTCCAGCCT 6600 

-159- 
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GGATATTCTC CTTCCCCAAC CCATCCTCCT ACTGGACAGT ATACGCTCTT 6650 
5 CCCTCTTCCA CCCACCTTGC CCACCCCTGT GGTCCAGCTC CACCCCCTGC 6700 
TTCCTGACCC TTCTGCTCCA ACGCCCACCC CTACCAGCCC TCTTCTAAAC 6750 

1 0 

ACATCCTACA CCCACTCCCA GAATCTGTCT CAGGAAGGGT AAGGTTCTCA 6800 

1 5 GACACT GCCG ACATCAGCAT TGTCTCATGT ACAGCTCCCT TCCCTGCAGG 6850 

GCGCCCCTGG GAGACAACTG GACAAGATTT CCTACTTTCT CCTGAAACCC 6900 

2 0 AAAGCCCTGG TAAAAGGGAT ACACAGGACT GAAAAGGGAA TCATTTTTCA 6950 

CTGTACATTA TAAACCTTCA GAAGCTATTT TTTTAAGCTA TCAGCAATAC 7000 

25 

TCATCAGAGC AGCTAGCTCT TTGGTCTATT TTCTGCAGAA ATTTGCAACT 7050 
30 CACTGATTCT CTACATGCTC TTTTTCTGTG ATAACTCTGC AAAGGCCTGG 7100 

GCTGGCCTGG CAGTTGAACA GAGGGAGAGA CTAACCTTGA GTCAGAAAAC 7150 
3 5 AGAGAAAGGG TAATTTCCTT TGCTTCAAAT TCAAGGCCTT CCAACGCCCC 7200 
CATCCCCTTT ACTATCATTC TCAGTGGGAC TCTGATCCCA TATTCTTAAC 7250 

40 

AGATCTTTAC TCTTGAGAAA TGAATAAGCT TTCTCTCAGA AATGCTGTCC 7300 
45 CTATACACTA GACAAAACTG AGCCTGTATA AGGAATAAAT GGGAGCGCCG 7350 

AAAAGCTCCC TAAAAAGCAA GGGAAAGATG TTCTTCGAGG GTGGCAATAG 7400 
5 0 ATCCCCCTCA CCCTGCCACC CCAAACAAAA AAGCTAACAG GAAGCCTTGG 7450 

AGAGCCTCAC ACCCCAGGTA AGGCTGTGTA GACAGTTCAG TAAAGACAGG 7500 

55 

ACCTGGATGT GACAGCTGAG CAAACAGCTA GAGCTTTGGC AGCTCAGCAG 7550 



60 



GAGGCTTTGC CAGGCATGGA CGCCTGCCTC CCTCCTGTGG AGGTCAGGAG 7600 
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GAAGTGCAGO AAGTGGCATG AGTCAGGCTC CTTGAGCTCA CACAGCAGGA 7650 
GAACAAGTAC AAGTCAAGTA CAAGTTGAAG GCTCATTTCC CAGTTCCCGC 7700 
AAATGCATCT AAAAAGCAGC TCTGTGTGAC CACCATAAAC TCTGCTAGGG 7750 
1 0 GATCTCTAAA AAGGAGTCAG GCTTATGGGG CTTTGCAAAT AAGTGCTGCC 7800 
TTGGTGCTCA GGAAAAGGTT TGTGTTGCAC AAAACACAAA TTCCACTGC 7849 

1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1443 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

2 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GAGTCCTTGG CCCACCTCTC TCCCACCCGA CTCTGCCGAA AGAAGCACAG 50 

30 

AAGCTCAAGC CGCCTCCATG GCCCCAGGAA AGATTCAGGG GAGAGGCCCC 100 
ATACAGGGAG CCACTTCAGT TAGACACCCT GGCCAGAATG GAGCTGACTG 150 
ATTTGCTCCT GGCGGCCATG CTTCTTGCAG TGGCAAGACT AACTCTGTCC 200 

4 0 AGCCCCGTAG CTCCTGCCTG TGACCCCAGA CTCCTAAATA AACTGCTGCG 250 

TGACTCCCAC CTCCTTCACA GCCGACTGAG TCAGTGTCCC GACGTCGACC 300 

45 

CTTTGTCTAT CCCTGTTCTG CTGCCTGCTG TGGACTTTAG CCTGGGAGAA 350 
TGGAAAACCC AGACGGAACA GAGCAAGGCA CAGGACATTC TAGGGGCAGT 400 

5 0 

GTCCCTTCTA CTGGAGGGAG TGATGGCAGC ACGAGGACAG TTGGAACCCT 450 
5 5 CCTGCCTCTC ATCCCTCCTG GGACAGCTTT CTGGGCAGGT TCGCCTCCTC 500 
TTGGGGGCCC TGCAGGOCCT CCTAGGAACC CAGGGCAGGA CCACAGCTCA 550 

60 

CAAGGACCCC AATGCCCTCT TCTTGAGCTT GCAACAACTG CTTCGGGGAA 600 

-161- 



(i) 

20 
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AGGTGCGCTT CCTGCTTCTG GTAGAARCtt n/-.™-. „ 

^ iA(j AAGGTC CCACCCTCTG TGTCAGACGG 650 

5 

ACCCTGCCAA CCACAGCTGT CCCAAGCAGT ACTTCTCAAC TCCTCACACT 700 
1 o AAACAAGTTC CCAAACAGGA CTTCTGGATT GTTGGAGACG AACTTCAGTG 750 

TCACAGCCAG AACTGCTGGC CCTGGACTTC TGAGCAGGCT TCAGGGATTC 800 
1 5 AGAGTCAAGA TTACTCCTGG TCAGCTAAAT CAAACCTCCA GGTCCCCAGT 850 
CCAAATCTCT GGATACCTGA ACAGGACACA CGGACCTGTG AATGGAACTC 900 

20 

ATGGGCTCTT TGCTGGAACC TCACTTCAGA CCCTGGAAGC CTCAGACATC 950 
25 TCGCCCGGAG CTTTCAACAA AGGCTCCCTG GCATTCAACC TCCAGGGTGG 1000 

ACTTCCTCCT TCTCCAAGCC TTGCTCCTGA TGGACACACA CCCTTCCCTC 1050 
30 CTTCACCTGC CTTGCCCACC ACCCATGGAT CTCCACCCCA GCTCCACCCC 1100 

CTGTTTCCTG ACCCTTCCAC CACCATGCCT AACTCTACCG CCCCTCATCC 1150 

35 

AGTCACAATG TACCCTCATC CCAGGAATTT GTCTCAGGAA ACATAGCGCG 1200 
40 SGCACTGGCC CAGTGAGCGT CTGCAGCTTC TCTCGGGGAC AAGCTTCCCC 1250 

AGGAAGGCTG AGAGGCAGCT GCATCTGCTC CAGATGTTCT GCTTTCACCT 1300 
45 AAAAGGCCCT GGGGAAGGGA TACACAGCAC TGGAGATTGT AAAATTTTAQ 1350 
GAGCTATTTT TTTTTAACCT ATCAGCAATA TTCATCAGAG CAGCTAGCGA 1400 

50 

TCTTTGGTCT ATTTTCGGTA TAAATTTGAA AATCACTAAT TCT 1443 

55 (2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 352 amino acids 
IB) TYPE: amino acid 
60 (D) TOPOLOGY: linear 



1 62- 
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1 0 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Glu Leu Thr Asp Leu Leu Leu Ala Ala Met Leu Leu Ala Val 
15 10 15 

Ala Arg Leu Thr Leu Ser Ser Pro Val Ala Pro Ala Cys Asp Pro 
20 25 30 

Arg Leu Leu Asn Lys Leu Leu Arg Asp Ser His Leu Leu His Ser 
3 5 40 45 

Arg Leu Ser Gin Cys Pro Asp Val Asp Pro Leu Ser He Pro Val 
5 0 55 60 

1 5 Leu Leu Pro Ala Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin 

65 70 75 

Thr Glu Gin Ser Lys Ala Gin Asp He Leu Gly Ala Val Ser Leu 
20 80 85 90 

Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin Leu Glu Pro Ser 
9 5 100 105 

Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin Val Arg Leu 
25 110 H5 120 

Leu Leu Gly Ala Leu Gin Gly Leu Leu Gly Thr Gin Gly Arg Thr 
125 130 135 

3 0 Thr Ala His Lys Asp Pro Asn Ala Leu Phe Leu Ser Leu Gin Gin 

140 145 150 



35 



Leu Leu Arg Gly Lys Val Arg Phe Leu Leu Leu Val Glu Gly Pro 
155 160 165 

Thr Leu Cys Val Arg Arg Thr Leu Pro Thr Thr Ala Val Pro Ser 
170 175 iso 

Ser Thr Ser Gin Leu Leu Thr Leu Asn Lys Phe Pro Asn Arg Thr 
40 185 190 195 

Ser Gly Leu Leu Glu Thr Asn Phe Ser Val Thr Ala Arg Thr Ala 
200 205 210 

45 Gly Pro Gly Leu Leu Ser Arg Leu Gin Gly Phe Arg Val Lys He 

215 220 225 

Thr Pro Gly Gin Leu Asn Gin Thr Ser Arg Ser Pro Val Gin He 
50 230 235 2 <0 

Ser Oly Tyr Leu Asn Arg Thr His Gly Pro Val Asn Gly Thr His 
245 250 255 

Gly Leu Phe Ala Gly Thr Ser Leu Gin Thr Leu Glu Ala Ser Asd 
55 260 265 270 

He Ser Pro Gly Ala Phe Asn Lys Gly Ser Leu Ala Phe Asn Leu 
275 280 285 

60 Gin Gly Gly Leu Pro Pro Ser Pro Ser Leu Ala Pro Asp Gly His 

2 *° 295 300 
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Thr Pro Phe Pro Pro Ser Pro Ala Leu Pro Thr Thr His Gly Ser 
305 310 315 

5 Pro Pro Gin Leu His Pro Leu Phe Pro Asp Pro Ser Thr Thr Met 
320 325 330 

Pro Asn Ser Thr Ala Pro His Pro Val Thr Met Tyr Pro His Pro 
1Q 33 5 340 345 

Arg Asn Leu Ser Gin Glu Thr 
350 352 



1 5 



20 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1536 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
2 5 GAGTCCTTGG CCCACCTCTC TCCCACCCGA CTCTGCCGAA AGAAGCACAG 50 
AAGCTCAAGC CGCCTCCATG GCCCCAGGAA AGATTCAGGG GAGAGGCCCC 100 
ATACAGGGAG CCACTTCAGT TAGACACCCT GGCCAGAATG GAGCTGACTG 150 
ATTTGCTCCT GGCGGCCATG CTTCTTGCAG TGGCAAGACT AACTCTGTCC 200 
AGCCCCGTAG CTCCTGCCTG TGACCCCAGA CTCCTAAATA AACTGCTGCG 250 

4 0 TGACTCCCAC CTCCTTCACA GCCGACTGAG TCAGTGTCCC GACGTCGACC 300 

CTTTGTCTAT CCCTGTTCTG CTGCCTGCTG TGGACTTTAO CCTGGGAGAA 350 

45 

TGGAAAACCC AGACGGAACA GAGCAAGGCA CAGGACATTC TAGGGGCAGT 400 
5Q GTCCCTTCTA CTGGAGGGAG TGATGGCAGC ACGAGGACAG TTGGAACCCT 450 
CCTOCCTCTC ATCCCTCCTG GGACAGCTTT CTGGGCAGGT TCGCCTCCTC 500 

5 5 TTGGGGGCCC TGCAGGGCCT CCTAGGAACC CAGCTTCCTC TACAGGGCAG 550 

GACCACAGCT CACAAGGACC CCAATGCCCT CTTCTTGAGC TTGCAACAAC 600 

60 

TGCTTCGGGG AAAGGTGCGC TTCCTGCTTC TGGTAGAAGG TCCCACCCTC 650 
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1 0 



20 



25 



TGTGTCAGAC GGACCCTGCC AACCACAGCT GTCCCAAGCA GTACTTCTCA 700 
ACTCCTCACA CTAAACAAGT TCCCAAACAG GACTTCTGGA TTGTTGGAGA 750 
CGAACTTCAG TGTCACAGCC AGAACTGCTG GCCCTGGACT TCTGAGCAGG 800 
CTTCAGGGAT TCAGAGTCAA GATTACTCCT GGTCAGCTAA ATCAAACCTC 850 
1 5 CAGGTCCCCA GTCCAAATCT CTGGATACCT GAACAGGACA CACGGACCTG 900 
TGAATGGAAC TCATGGGCTC TTTGCTGGAA CCTCACTTCA GACCCTGGAA 950 
GCCTCAGACA TCTCGCCCGG AGCTTTCAAC AAAGGCTCCC TGGCATTCAA 1000 
CCTCCAGGGT GGACTTCCTC CTTCTCCAAG CCTTGCTCCT GATGGACACA 1050 
CACCCTTCCC TCCTTCACCT GCCTTGCCCA CCACCCATGG ATCTCCACCC 1100 

3 0 CAGCTCCACC CCCTGTTTCC TGACCCTTCC ACCACCATGC CTAACTCTAC 1150 

CGCCCCTCAT CCAGTCACAA TGTACCCTCA TCCCAGGAAT TTGTCTCAGG 1200 
AAACATAGCG CGGGCACTGG CCCAGTGAGC GTCTGCAGCT TCTCTCGGGG 1250 
ACAAGCTTCC CCAGGAAGGC TGAGAGGCAG CTGCATCTGC TCCAGATGTT 1300 
CTGCTTTCAC CTAAAAGGCC CTGGGGAAGG GATACACAGC ACTGGAGATT 1350 

4 5 GTAAAATTTT AGGAGCTATT TTTTTTTAAC CTATCAGCAA TATTCATCAG 1400 

AGCAGCTAGC GATCTTTGGT CTATTTTCGG TATAAATTTG AAAATCACTA 1450 

50 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1500 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 1536 " 

5 5 



(2) INFORMATION FOR SEQ ID NO: 15: 

6 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 amino acids 



35 



40 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15- 

5 

Met Glu Leu Thr Asp Leu Leu Leu Ala Ala Met Leu Leu Ala Val 
5 10 15 

1 q Ala Arg Leu Thr Leu Ser Ser Pro Val Ala Pro Ala Cys Asp Pro 

20 25 30 

Arg Leu Leu Asn Lys Leu Leu Arg Asp Ser His Leu Leu His Ser 
3 5 40 45 



1 5 Arg Leu Ser Gin 



Cys Pro Asp Val Asp Pro Leu Ser He Pro Val 
50 55 60 

Leu Leu Pro Ala Val Asp Phe Ser Leu Gly Glu Trp Lys Thr Gin 
20 " 70 75 

Thr Glu Gin Ser Lys Ala Gin Asp lie Leu Gly Ala Val Ser Leu 
80 85 90 

Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin Leu Glu Pro Ser 
" 95 100 105 

Cys Leu Ser Ser Leu Leu Gly Gin Leu Ser Gly Gin Val Arg Leu 
110 US 120 

3 0 Leu Leu Gly Ala Leu Gin Gly Leu Leu Gly Thr Gin Leu Pro Leu 

125 130 135 

Gin Gly Arg Thr Thr Ala His Lys Asp Pro Asn Ala Leu Phe Leu 
35 140 145 «0 

Ser Leu Gin Gin Leu Leu Arg Gly Lys Val Arg Phe Leu Leu Leu 
155 160 165 

Val Glu Gly Pro Thr Leu Cys Val Arg Arg Thr Leu Pro Thr Thr 
170 17S 180 

Ala val Pro Ser Ser Thr Ser Gin Leu Leu Thr Leu Asn Lys Phe 
185 190 195 

45 Pro Asn Arg Thr Ser Gly Leu Leu Glu Thr Asn Phe Ser Val Thr 

200 2 05 210 

Ala Arg Thr Ala Gly Pro Gly Leu Leu Ser Arg Leu Gin Gly Phe 
50 215 220 2 « 

Arg Val Lys He Thr Pro Gly Gin Leu Asn Gin Thr Ser Arg Ser 
230 2 35 240 

Pro Val Gin He Ser Gly Tyr Leu Asn Arg Thr His Gly Pro Val 
. 245 250 255 

Asn Gly Thr His Gly Leu Phe Ala Gly Thr Ser Leu Gin Thr Leu 
260 265 270 

60 Glu Ala Ser Asp lie Ser Pro Gly Ala Phe Asn Lys Gly Ser Leu 

275 280 285 



40 
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Ala Phe Asn Leu Gin G] v rn, « 

oon 7 y ^ eu Pro Pro Ser Pro Ser Leu Ala 

290 295 3 oo 

5 Pro Asp Gly His Thr Pro Phe Pro Pro Ser Pro Ala Leu Pro Thr 

305 310 315 

Thr His Gly Ser Pro Pro Gin Leu His Pro Leu Phe Pro Asp Pro 

1 0 320 325 330 

Ser Thr Thr Met Pro Asn Ser Thr Ala Pro His Pro Val Thr Met 

335 340 345 



1 5 



25 



30 



Tyr Pro His Pro Arg Asn Leu Ser Gin Glu Thr 
350 355 356 

(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 
(DJ TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ser Pro Val Ala Pro Ala Cys Asp Pro Arg Leu Leu Asn Lys Leu 
1 5 10 15 

Leu Arg Asp Ser His Leu Leu His Ser Arg Leu Ser Gin Cys Pro 
20 25 30 

Asp Val Asp Pro Leu Ser lie Pro Val Leu Leu Pro Ala Val- Asp 
3 5 40 45 

3 5 Phe Ser Leu Gly Glu Trp Lys Thr Gin Thr Glu Gin Ser Lys Ala 

50 55 60 

Gin Asp He Leu Gly Ala Val Ser Leu Leu Leu Glu Gly Val Met 
40 " 70 7 5 

Ala Ala Arg Gly Gin Leu Glu Pro Ser Cys Leu Ser Ser Leu Leu 
80 85 9 o 

Gly Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin 
4& 95 100 io5 

Gly Leu Leu Gly Thr Gin Leu Pro Leu Gin Gly Arg Thr Thr Ala 
110 U5 120 

5 0 His Lys Asp Pro Asn Ala Leu Phe Leu Ser Leu Gin Gin Leu Leu 

125 130 135 

Arg Gly Lys Asp Phe Trp He Val Gly Asp Glu Leu Gin Cys His 
55 140 145 

Ser Gin Asn Cys Trp Pro Trp Thr Ser Glu Gin Ala Ser Gly He 
15 5 160 i 6 5 

Gin Ser Gin Asp Tyr Ser Trp Ser Ala Lys Ser Asn Leu Gin Val 
60 170 175 180 
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15 



50 



Pro Ser Pro Asn Leu Trp lie Prn rv, * 

185 P hr Thr ^ 



190 



195 



Glu Trp Asn Ser Trp Ala Leu Cys Trp Asn L eu Th r Ser Asp Pro 



205 



210 



Gly Ser Leu Arg His Leu Ala Arg Ser Phe Gin Gin Arg Leu Pro 

215 220 2 2 5 

1 0 Gly lie G m Pro P Gly Trp Thr ^ ^ 

230 235 2 40 



Ser 
241 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 335 amino acids 
20 <B> TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

2 5 ser Pro Val Ala Pro Ala Cys Asp Pro Arg Leu Leu Asn Lys Leu 
1 5 10 



15 



Leu Arg Asp Ser His Leu Leu His Ser Arg Leu Ser Gin Cys Pro 



30 25 30 

Asp Val Asp Pro Leu Ser lie Pro Val Leu Leu Pro Ala Val Asp 

35 <° 45 

Phe Ser Leu Gly Glu Trp Lys Thr Gin Thr Glu Gin Ser Lys Ala 

50 55 go 

Gin Asp He Leu Gly Ala Val Ser Leu Leu Leu Glu Gly Val Met 

65 70 75 



4 0 Ala Ala Arg Gly Gin Leu Glu 



80 85 



Pro Ser Cys Leu Ser Ser Leu Leu 

90 



Gly Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin 
45 5 100 105 

Gly Leu Leu Gly Thr Gin Leu Pro Leu Gin Gly Arg Thr Thr Ala 

110 us 120 

His Lys Asp Pro Asn Ala Leu Phe Leu Ser Leu Gin Gin Leu Leu 
125 130 135 

Arg Gly Lys Val Arg Phe Leu Leu Leu Val Glu Gly Pro Thr Leu 
140 "5 150 

5 5 cys Val Arg Arg Thr Leu Pro Thr Thr Ala Val Pro Ser Ser Thr 

155 160 165 

Ser Gin Leu Leu Thr Leu Asn Lys Phe Pro Asn Arg Thr Ser Gly 
60 70 175 180 
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Leu Leu Glu Thr Asn Phe Ser Val Thr Ala Arg Thr Ala Gly Pro 
85 190 195 

Gly Leu Leu Ser Arg Leu Gin Gly Phe Arg Val Lys Ile Thr pro 

205 210 



Gly Gin Leu Asn Gin Thr Ser Arg Ser Pro Val Gin lie Ser Gly 
215 220 22 5 

1 0 Tyr Leu Asn Arg Thr His Gly Pro Val Asn Gly Thr His Gly Leu 
230 235 240 

Phe Ala Gly Thr Ser Leu Gin Thr Leu Glu Ala Ser Asp lie Ser 
15 245 250 255 

Pro Gly Ala Phe Asn Lys Gly Ser Leu Ala Phe Asn Leu Gin Gly 
250 265 21 l 

Gly Leu Pro Pro Ser Pro Ser Leu Ala Pro Asp Gly His Thr Pro 
275 280 285 

Phe Pro Pro Ser Pro Ala Leu Pro Thr Thr His Gly Ser Pro Pro 
290 295 300 

2 5 cm l^u „ is Pro Leu Phe Pro Asp Pro Ser Thr Thr Met Pro Asn 



20 



35 



40 



50 



310 



315 



Ser Thr Ala Pro His Pro Val Thr Met Tyr Pro His Pro Arg Asn 
30 325 330 

Leu Ser Gin Glu Thr 
335 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Ser Pro Ala Pro Pro Ala Cys Asp Pro Arg Leu Leu Asn Lys Leu 
45 10 15 

Leu Arg Asp Ser His Val Leu His Gly Arg Le U ser Gin Cys Pro 



20 9c 



Asp Ile Asn Pro Leu Ser Thr Pro Val Leu Leu Pro Ala Val Asp 



40 



45 



Phe Thr Leu Gly Glu Trp Lys Thr Gin Thr Glu Gin Thr Lys Ala 

55 60 

5 5 Gin Asp Val Leu Gly Ala Thr Thr Leu Leu Leu Glu Ala Val Met 

65 ™ 75 



Thr Ala Arg Gly Gin Val Gly Pro Pro Cys Leu Ser 



80 oc 

60 85 90 



Ser Leu Leu 



-169- 



WO 95/18858 



PCT/US94/14553 



Val Gin Leu Ser Glv r,ln - * 

fcly Gin Vai Arg Leu Leu Leu Gly Ala Leu Gin 

100 105 

Asp Leu Leu Gly Met Gin Leu Pro Pro Gln Gly ^ Thr Thf ^ 

U0 H5 120 

His Lys Asp Pro Ser A!a lie ?he Leu Asn Phe Gin Gin Leu Leu 

125 "a 135 



1 0 Arg Gly Lys Val Arg Phe Leu Leu Leu ^ ^ Qly ^ ^ 



140 



145 



Leu 
150 



Cys Ala Lys Arg Ala Pro Pro Ala lie Ala Val Pro Ser Ser Thr 
15 155 ISO 165 



20 



Ser Pro Phe His Thr Leu Asn Lys Leu Pro Asn Arg Thr Ser Gly 
170 "5 18 J 

Leu Leu Glu Thr Asn Ser Ser lie Ser Ala Arg Thr Thr Gly Sar 
185 19° 195 

Gly Phe Leu Lys Arg Leu Gin Ala Phe Arg Ala Lys Il e Pro Gly 
200 205 210 

2 5 Leu Leu Asn Gin Thr Ser Arg Ser Leu Asp Gin lie Pro Gly His 

215 220 225 



Gin Asn Gly Thr His Gly Pro Leu Ser Gly He His Gly Leu Phe 
30 230 "5 2 40 



Pro Gly Pro Gin Pro Gly Ala Leu Gly Ala Pro Asp lie 



250 



Pro Pro 
255 



Ala Thr Ser Gly Met Gly Ser Arg Pro Thr Tyr Leu Gin 



3 5 * " * ro Tnr TVT Leu Gin Pro Gly 

265 270 

Glu Ser Pro Ser Pro Ala His Pro Ser Pro Gly Arg Tyr Thr Leu 



280 



4 0 Phe Ser 



285 



Pro Ser Pro Thr Ser Pro Ser Pro Thr Val Gin Leu Gin 



295 



300 



Pro Leu Leu Pro Asp Pro Ser Ala He Thr Pro Asn Ser Thr Ser 
45 305 3" 315 

Pro Leu Leu Phe Ala Ala His Pro His Phe Gin Asn Leu Ser Gin 



320 -inc 

325 330 



Glu Glu 
5 0 332 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS • 
55 (A) LENGTH: 1026 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 
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AGCCCGGCTC CTCCTCCCTG TGACCCCCGA CTCCTAAATA AACTGCTTCG 50 
5 TGACTCCCAT. GTCCTTCACG GCAGACTGAG CCAGTGCCCA GACATTAACC 100 
CTTTGTCCAC ACCTGTCCTG CTGCCTGCTG TGGACTTCAC CTTGGGAGAA 150 

10 

TGGAAAACCC AGACGGAGCA GACAAAGGCA CAGGATGTCC TGGGAGCCAC 200 
AACCCTTCTG CTGGAGGCAG TGATGACAGC ACGGGGACAA GTGGGACCCC 250 

1 5 

CTTGCCTCTC ATCCCTGCTG GTGCAGCTTT CTGGACAGGT TCGCCTCCTC 300 

2 0 CTCGGGGCCC TGCAGGACCT CCTTGGAATG CAGCTTCCTC CACAGGGAAG 350 

GACCACAGCT CACAAGGATC CCAGTGCCAT CTTCCTGAAC TTCCAACAAC 400 

25 

TGCTCCGAGG AAAGGTGCGT TTCCTGCTCC TTGTAGTGGG GCCCTCCCTC 450 
TGTGCCAAGA GGGCCCCACC CGCCATAGCT GTCCCGAGCA GCACCTCTCC 500 

30 

ATTCCACACA CTGAACAAGC TCCCAAACAG GACCTCTGGA TTGTTGGAGA 550 

3 5 CAAACTCCAG TATCTCAGCC AGAACTACTG GCTCTGGATT TCTCAAGAGG 600 

CTGCAGGCAT TCAGAGCCAA GATTCCTGGT CTGCTGAACC AAACCTCCAG 650 

40 

GTCCCTAGAC CAAATCCCTG GACACCAGAA TGGGACACAC GGACCCTTGA 700 
GTGGAATTCA TGGACTCTTT CCTGGACCCC AACCCGGGGC CCTCGGAGCT 750 

4 5 

CCAGACATTC CTCCAGCAAC TTCAGGCATG GGCTCCCGGC CAACCTACCT 800 

5 0 CCAGCCTGGA GAGTCTCCTT CCCCAGCTCA CCCTTCTCCT GGACGATACA 850 

CTCTCTTCTC TCCTTCACCC ACCTCGCCCT CCCCCACAGT CCAGCTCCAG 900 

55 

CCTCTGCTTC CTGACCCCTC TGCGATCACA CCCAACTCTA CCAGTCCTCT 950 
TCTATTTGCA GCTCACCCTC ATTTCCAGAA CCTGTCTCAG GAAGAGTAAG 1000 

60 

-1 71 - 
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GTGCTCAGAC CCTGCCAAC? TCAGCA 1026 



5 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH : 1014 bases 

10 lr\ TYPE: nucl ^ic acid 

1° C STRANDE3NESS : single 

(D) TOPOLOGY: linear 



1 5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
AGCCCGGCTC CTCCTGCCTG TGACCCCCGA CTCCTAAATA AACTGCTTCG 50 
2q TGACTCCCAT GTCCTTCACG GCAGACTGAG CCAGTGCCCA GACATTAACC 100 
CTTTGTCCAC ACCTGTCCTG CTGCCTGCTG TGGACTTCAC C TTGGG AGAA 150 
2 5 TGGAAAACCC AGACGGAGCA GACAAAGGCA CAGGATGTCC TGGGAGCCAC 200 
AACCCTTCTG CTGGAGGCAG TGATGACAGC ACGGGGACAA GTGGGACCCC 250 

30 

CTTGCCTCTC ATCCCTGCTG GTGCAGCTTT CTGGACAGGT TCGCCTCCTC 300 
^ CTCGGGGCCC TGCAGGACCT CCTTGGAATG CAGGGAAGGA CCACAGCTCA 350 

CAAGGATCCC AGTGCCATCT TCCTGAACTT CCAACAACTG CTCCGAGGAA 400 
4 0 AGGTGCGTTT CCTGCTCCTT GTAGTGGGGC CCTCCCTCTG TGCCAAGAGG 450 

GCCCCACCCG CCATAGCTGT CCCGAGCAGC ACCTCTCCAT TCCACACACT 500 

45 

GAACAAGCTC CCAAACAGGA CCTCTGGATT GTTCGAGACA AACTCCAGTA 550 
50 TCTCAGCCAG ^"GGC TCTGGATTTC TCAAGAGGCT GCAGGCATTC 600 

AGAGCCAAGA TTCCTGGTCT GCTGAACCAA ACCTCCAGGT CCCTAGACCA SS0 
5 5 AATCCCTGGA CACCAGAATG GGACACACGG ACCCTTGAGT GGAATTCATG 700 
GACTCTTTCC TGGACCCCAA CCCGGGGCCC TCGGAGCTCC AGACATTCCT 750 

60 

CCAGCAACTT CAGGCATGGG CTCCCGGCCA ACCTACCTCC AGCCTGGAGA 800 

-1 72- 
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G^TCCTTCC CCAGCTCACC CTTCTCCTOC ACGATACACT CTCrPCTC^ 850 

CrrCACCCAC ctcgccctcc cccacactcc agctccagcc ^tgcttcct soo 

1 o GACCCCTCTG CGATCACACC CAACTCTACC AGTCCTCTTC TATTTGCAGC 950 

TCACCCTCAT TTCCAGAACC TGTCTCAGGA AGAGTAAGGT GCTCAGACCC 1000 
1 5 TGCCAACTTC AGCA 1014 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO:21: 



Ser Pro Ala Pro Pro Ala Cys Asp Pro Arc Leu Leu Asn Lys Leu 
30 10 15 

Leu Arc Asp Ser His Val Leu His Gly Arg Leu Ser Gin Cys Pro 



25 



30 



35 



Asp He Asn Pro Leu Ser Thr Pro Val Leu Leu Pro Ala Val Asp 

Phe Thr Leu Gly oiu Trp Lys Thr Gin Thr Glu Gin Thr Lys Ala 

55 60 
4 0 Gin Asp Val Leu Gly Ala Thr Thr Leu Leu Leu Glu Ala Val Met 

70 ?5 

Thr Ala Arg Gly Gin Val Gly Pro Pro Cys Leu Ser Ser Leu Leu 
45 85 90 

Val Gin Leu Ser Gly Gln Val Arg Leu Leu Leu Gly Ala Leu Gin 

100 105 
Asp Leu Leu Gly Met Gin Gly Arg Thr Thr Ala His Lys Asp Pro 

115 120 
Ser Ala lie Phe Leu Asn Phe Gin Gin Leu Leu Arg Gly Lys Val 



50 



55 Arg Phe Leu 



130 135 



Leu Leu Val Val Gly Pro Ser Leu Cys Al. Lys Arg 



145 



150 



Ala Pro Pro A la lie AI. Val Pro Ser Ser Thr Ser Pro Phe His 
60 16 ° 165 
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Thr Leu Asn Lys Leu Pro Asn Arg Thr Ser Gly Leu Leu Glu Thr 
170 175 180 

Asn Ser Ser lie Ser Ala Arg Thr Thr Gly Ser Gly Phe Leu Lys 

185 190 195 

Arg Leu Gin Ala Phe Arg Ala Lys He Pro Gly Leu Leu Asn Gin 

200 205 210 

1 0 Thr Ser Arg Ser Leu Asp Gin lie Pro Gly His Gin Asn Gly Thr 

215 220 225 

His Gly Pro Leu Ser Gly lie His Gly Leu Phe Pro Gly Pro Gin 
1 5 230 "5 240 

Pro Gly Ala Leu Gly Ala Pro Asp lie Pro Pro Ala Thr Ser Gly 

245 250 255 

Met Gly ser Arg Pro Thr Tyr Leu Gin Pro Gly Glu Ser Pro Ser 
260 265 270 

Pro Ala His Pro Ser Pro Gly Arg Tyr Thr Leu Phe Ser Pro Ser 
275 280 285 

2 5 Pro Thr Ser Pro Ser Pro Thr Val Gin Leu Gin Pro Leu Leu Pro 

290 295 300 

Asp Pro Ser Ala lie Thr Pro Asn Ser Thr Ser Pro Leu Leu Phe 
30 305 310 315 

Ala Ala His Pro His Phe Gin Asn Leu Ser Gin Glu 



35 



Glu 

320 325 328 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5141 bases 

(B) TYPE: nucleic acid 

. _ <C) STRANDEDNESS : double 

40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
4 5 TTCGAGCTCG CCCGACATTG ATTATTGACT AGAGTCGATC GACAGCTGTG SO 
GAATGTGTGT CAGTTAGGGT GTGGAAAGTC CCCAGGCTCC CCAGCAGGCA 100 

50 

GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAG GTGTGGAAAG 150 
55 TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 200 

GTCAGCAACC ATAGTCCCGC CCCTAACTCC GCCCATCCCG CCCCTAACTC 250 
60 CGCCCAGTTC CGCCCATTCT CCGCCCCATG GCTGACTAAT TTTTTTTATT 300 
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TATGCAGAGG CCGAGGCCGC CTCGGCCTCT GAGCTATTCC AGAAGTAGTG 350 
5 AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG CAAAAAGCTA GCTTATCCGG 4 00 
CCGGGAACGG TGCATTGGAA CGCGGATTCC CCGTGCCAAG AGTGACGTAA 450 

1 0 

GTACCGCCTA TAGAGCGATA AGAGGATTTT ATCCCCGCTG CCATCATGGT 500 
TCGACCATTG AACTGCATCG TCGCCGTGTC CCAAAATATG GGGATTGGCA 550 

1 0 

AGAACGGAGA CCTACCCTGG CCTCCGCTCA GGAACGAGTT CAAGTACTTC 600 

2 0 CAAAGAATGA CCACAACCTC TTCAGTGGAA GGTAAACAGA ATCTGGTGAT 650 

TATGGGTAGG AAAACCTGGT TCTCCATTCC TGAGAAGAAT CGACCTTTAA 700 

25 

AGGACAGAAT TAATATAGTT CTCAGTAGAG AACTCAAAGA ACCACCACGA 750 
GGAGCTCATT TTCTTGCCAA AAGTTTGGAT GATGCCTTAA GACTTATTGA 800 

3 0 

ACAACCGGAA TTGGCAAGTA AAGTAGACAT GGTTTGGATA GTCGGAGGCA 850 
3 5 GTTCTGTTTA CCAGGAAGCC ATGAATCAAC CAGGCCACCT TAGACTCTTT 900 
GTGACAAGGA TCATGCAGGA ATTTGAAAGT GACACGTTTT TCCCAGAAAT 950 

40 

TGATTTGGGG AAATATAAAC CTCTCCCAGA ATACCCAGGC GTCCTCTCTG 1000 
AGGTCCAGGA GGAAAAAGGC ATCAAGTATA AGTTTGAAGT CTACGAGAAG 1050 
AAAGACTAAC AGGAAGATGC TTTCAAGTTC TCTGCTCCCC TCCTAAAGCT 1100 

5 0 ATGCATTTTT ATAAGACCAT GGGACTTTTG CTGGCTTTAG ATCCCCTTGG 1150 

CTTCGTTAGA ACGCGGCTAC AATTAATACA TAACCTTATG TATCATACAC 1200 

55 

ATACGATTTA GGTGACACTA TAGATAACAT CCACTTTGCC TTTCTCTCCA 1250 
CAGGTGTCCA CTCCCAGGTC CAACTGCACC TCGGTTCTAA GCTTCTGCAG 1300 

6 0 
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GTCGACTCTA GAGGATCCCC GGGGAATTCA ATCGATGGCC GCCATCGCCC 1350 
g AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA ATAGCATCAC 1400 
AAATTTCACA AATAAAGCAT TTTTTTCACT GCATTCTAGT TGTGGTTTGT 1450 
1 0 CCAAACTCAT CAATGTATCT TATCATGTCT GGATCGATCG GGAATTAATT 1500 
CGGCGCAGCA CCATGGCCTG AAATAACCTC TGAAAGAGGA ACTTGGTTAG 1550 

1 5 

GTACCTTCTG AGGCGGAAAG AACCAGCTGT GGAATGTGTG TCAGTTAGGG 1600 
2q TGTGGAAAGT CCCCAGGCTC CCCAGCAGGC AGAAGTATGC AAAGCATGCA 1650 

TCTCAATTAG TCAGCAACCA GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG 1700 
2 5 GCAGAAGTAT GCAAAGCATG CATCTCAATT AGTCAGCAAC CATAGTCCCG 1750 
CCCCTAACTC CGCCCATCCC GCCCCTAACT CCGCCCAGTT CCGCCCATTC 1800 

30 

TCCGCCCCAT GGCTGACTAA TTTTTTTTAT TTATGCAGAG GCCGAGGCCG 1850 
35 CCTCGGCCTC TGAGCTATTC CAGAAGTAGT GAGGAGGCTT TTTTGGAGGC 1900 

CTAGGCTTTT GCAAAAAGCT GTTACCTCGA GCGGCCGCTT AATTAAGGCG 1950 
4 0 CGCCATTTAA ATCCTGCAGG TAACAGCTTG GCACTGGCCG TCGTTTTACA 2000 

ACGTCGTGAC TGGGAAAACC CTGGCGTTAC CCAACTTAAT CGCCTTGCAG 2050 

45 

CACATCCCCC CTTCGCCAGC TGGCGTAATA GCGAAGAGGC CCGCACCGAT 2100 
50 CGCCCTTCCC AACAGTTGCG TAGCCTGAAT GGCGAATGGC GCCTGATGCG 2150 

GTATTTTCTC CTTACGCATC TGTGCGGTAT TTCACACCGC ATACGTCAAA 2200 
55 GCAACCATAG TACGCGCCCT GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG 2250 
TGGTTACGCG CAGCGTGACC GCTACACTTG CCAGCGCCCT AGCGCCCGCT 2300 

60 

CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG 2350 
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TCAAGCTCTA AATCGGGGGC TCCCTTTAGG GTTCCGATTT AGTGCTTTAC 2400 

5 

GGCACCTCGA CCCCAAAAAA CTTGATTTGG GTGATGGTTC ACGTAGTGGG 2450 
^ ^ CCATCGCCCT GATAGACGGT TTTTCGCCCT TTGACGTTGG AGTCCACGTT 2500 

CTTTAATAGT GGACTCTTGT TCCAAACTGG AACAACACTC AACCCTATCT 2550 
1 5 CGGGCTATTC TTTTGATTTA TAAGGGATTT TGCCGATTTC GGCCTATTGG 2600 
TTAAAAAATG AGCTGATTTA ACAAAAATTT AACGCGAATT TTAACAAAAT 2650 

20 

ATTAACGTTT ACAATTTTAT GGTGCACTCT CAGTACAATC TGCTCTGATG 2700 
2g CCGCATAGTT AAGCCAACTC CGCTATCGCT ACGTGACTGG GTCATGGCTG 2750 

CGCCCCGACA CCCGCCAACA CCCGCTGACG CGCCCTGACG GGCTTGTCTG 2800 
30 CTCCCGGCAT CCGCTTACAG ACAAGCTGTG ACCGTCTCCG GGAGCTGCAT 2850 

GTGTCAGAGG TTTTCACCGT CATCACCGAA ACGCGCGAGG CAGTATTCTT 2900 

35 

GAAGACGAAA GGGCCTCGTG ATACGCCTAT TTTTATAGGT TAATGTCATG 2950 
40 ATAATAATGG TTTCTTAGAC GTCAGGTGGC ACTTTTCGGG GAAATGTGCG 3000 

CGGAACCCCT ATTTGTTTAT TTTTCTAAAT ACATTCAAAT ATGTATCCGC 3050 
45 TCATGAGACA ATAACCCTGA TAAATGCTTC AATAATATTG AAAAAGGAAG 3100 
AGTATGAGTA TTCAACATTT CCGTGTCGCC CTTATTCCCT TTTTTGCGGC 3150 

50 

ATTTTGCCTT CCTGTTTTTG CTCACCCAGA AACGCTGGTG AAAGTAAAAG 3200 
55 ATGCTGAAGA TCAGTTGGGT GCACGAGTGG GTTACATCGA ACTGGATCTC 3250 

AACAGCGGTA AGATCCTTGA GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT 3300 
60 GATGAGCACT TTTAAAGTTC TGCTATGTGG CGCGGTATTA TCCCGTGATG 3350 
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ACGCCGGGCA AGAGCAACTC GGTCGCCGCA TACACTATTC TCAGAATGAC 3400 
5 TTGGTTGAGT ACTCACCAGT CACAGAAAAG CATCTTACGG ATGGCATGAC 3450 
AGTAAGAGAA TTATGCAGTG CTGCCATAAC CATGAGTGAT AACACTCCGG 3 500 

1 0 

CCAACTTACT TCTGACAACG ATCGGAGGAC CGAAGGAGCT AACCGCTTTT 3 550 

1 g TTGCACAACA TGGGGGATCA TGTAACTCGC CTTGATCGTT GGGAACCGGA 3600 

GCTGAATGAA GCCATACCAA ACGACGAGCG TGACACCACG ATGCCAGCAG 3650 

2 0 CAATGGCAAC AACGTTGCGC AAACTATTAA CTGGCGAACT ACTTACTCTA 3700 

GCTTCCCGGC AACAATTAAT AGACTGGATG GAGGCGGATA AAGTTGCAGG 3750 

25 

ACCACTTCTG CGCTCGGCCC TTCCGGCTGG CTGGTTTATT GCTGATAAAT 3800 
CTGGAGCCGG TGAGCGTGGG TCTCGCGGTA TCATTGCAGC ACTGGGGCCA 3850 

o 0 

GATGGTAAGC CCTCCCGTAT CGTAGTTATC TACACGACGG GGAGTCAGGC 3900 

3 5 AACTATGGAT GAACGAAATA GACAGATCGC TGAGATAGGT GCCTCACTGA 3950 

TTAAGCATTG GTAACTGTCA GACCAAGTTT ACTCATATAT ACTTTAGATT 4000 

40 

GATTTAAAAC TTCATTTTTA ATTTAAAAGG ATCTAGGTGA AGATCCTTTT 4050 
TGATAATCTC ATGACCAAAA TCCCTTAACG TGAGTTTTCG TTCCACTGAG 4100 
CGTCAGACCC CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA TCCTTTTTTT 4150 
5 0 CTGCGCGTAA TCTGCTGCTT GCAAACAAAA AAACCACCGC TACCAGCGGT 4200 
GGTTTQTTTG CCGGATCAAG AGCTACCAAC TCTTTTTCCG AAGGTAACTG 4250 

55 

GCTTCAGCAG AGCGCAGATA CCAAATACTG TCCTTCTAGT GTAGCCGTAG 4300 
6Q TTAGGCCACC ACTTCAAGAA CTCTGTAGCA CCGCCTACAT ACCTCGCTCT 4350 
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GCTAATCCTG TTACCAGTGG CTGCTGCCAG TGGCGATAAG TCGTGTCTTA 4400 
^ CCGGGTTGGA CTCAAGACGA TAGTTACCGG ATAAGGCGCA GCGGTCGGGC 4450 
TGAACGGGGG GTTCGTGCAC ACAGCCCAGC TTGGAGCGAA CGACCTACAC 4500 
1 0 CGAACTGAGA TACCTACAGC GTGAGCATTG AGAAAGCGCC ACGCTTCCCG 4550 
AAGGGAGAAA GGCGGACAGG TATCCGGTAA GCGGCAGGGT CGGAACAGGA 4600 

1 5 

GAGCGCACGA GGGAGCTTCC AGGGGGAAAC GCCTGGTATC TTTATAGTCC 4650 
2Q TGTCGGGTTT CGCCACCTCT GACTTGAGCG TCGATTTTTG TGATGCTCGT 4700 
CAGGGGGGCG GAGCCTATGG AAAAACGCCA GCAACGCGGC CTTTTTACGG 4750 

2 5 TTCCTGGCCT TTTGCTGGCC TTTTGCTCAC ATGTTCTTTC CTGCGTTATC 4800 

CCCTGATTCT GTGGATAACC GTATTACCGC CTTTGAGTGA GCTGATACCG 4850 

30 

CTCGCCGCAG CCGAACGACC GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG 4900 
35 GAAGAGCGCC CAATACGCAA ACCGCCTCTC CCCGCGCG1T GGCCGATTCA 4950 
TTAATCCAGC TGGCACGACA GGTTTCCCGA CTGGAAAGCG GGCAGTGAGC 5000 

4 0 GCAACGCAAT TAATG TGAGT TACCTCACTC ATTAGGCACC CCAGGCTTTA 5050 

CACTTTATGC TTCCGGCTCG TATGTTGTGT GGAATTGTGA GCGGATAACA 5100 

45 

ATTTCACACA GGAAACAGCT ATGACCATGA TTACGAATTA A 5141 

5 0 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

60 

ATGTCNCCNG CNCCNCCNGC N 21 
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5 (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 21 bases 

|B) TYPE; nucleic acid 
10 n STRA ™EDNESS: single 

10 (D) TOPOLOGY: linear 

<*i) SEQUENCE DESCRIPTION : SEQ ID NO: 24: 
15 ATGTCTCCAG CGCCGCCAGC G 21 

2q (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS ■ 
(A) LENGTH: 21 bases 
(BJ TYPE: nucleic acid 
9 - ( c > STRANDEDNESS: single 

" (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
3 0 ATGTCGCCTG CTCCACCTGC T 21 



35 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 
An < c > STRANDEDNESS: single 
4U (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
4 5 ATGTCGCCAG CGCCACCAGC C 21 



50 



(2) INFORMATION FOR SEQ ID NO: 27: 

(ij SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 
cc <C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
6 0 ATGTCCCCAG CCCCACCCGC A 21 
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1 0 



1 5 



20 



25 



30 



40 



45 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 28; 

fi) SEQUENCE CHARACTERISTICS- 
IAr LENGTH: 21 bases ~ 
(B > TYPE: nucleic acid 
<C> STRANDEDNESS : single 
<D> TOPOLOGY: linear 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
ATGTCGCCAG CGCCGCCAGC G 21 

<2} INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS * 
(A) LENGTH: 25 amino acids 
(3) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xil SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Ser Pro Ala Pro Pro Ala Cys Asp Pro Arg Leu Leu Asn Lys Leu 



io 15 



Leu Arg Asp Asp His Val Leu His Gly Arg 



20 25 



(2) INFORMATION FOR SEQ ID NO: 30: 

3 5 (i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 26 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
Ser Pro Ala Pro Pro Ala Cys Asp Pro Arg Leu Leu Asn Lys Leu 



io 15 



Leu Arg Asp Asp Xaa Val Leu His Gly Arg Leu 
20 25 26 

(2) INFORMATION FOR SEQ ID NO: 31: 

50 (i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31; 
Ser Pro Ala Pro Pro Ala Xaa Asp Pro Arg Leu Leu Asn Lys Leu 



5 10 15 



Leu Arg Asp Asp His Val Leu His Gly Arg 



20 25 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS- 
5 (A) LENGTH: 14 am.no acids 

fB)- TYPE: amino acid 
(D) TOPOLOGY: linear 

io (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Xaa Pro Ala Pro Pro Ala Xaa Asp Pro Arg Leu Xaa Asn Lys 
(2) INFORMATION FOR SEQ ID NO: 33: 



1 5 



25 



30 



U) SEQUENCE CHARACTERISTICS* 
(A) LENGTH : 9 amino acids 
<B) TYPE: amino acid 
20 < D ) TOPOLOGY: linear 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Pro Arg Leu Leu Asn Lys Leu Leu Arg 
1 5 9 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS ■ 
<A> LENGTH: 45 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

GCCGTGAAGG ACGTGGTCGT CACGAAGCAG TTTATTTAGG AGTCG 45 

40 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS * 
(A) LENGTH: 20 bases 
45 <B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

so (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

CCNGCNCCNC CNGCNTGYGA 20 



(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(xi, SEQUENCE DESCRIPTION: SEQ ID NO:36: 

5 

NCCRTGNAKN ACRTGRTCRT C 21 

10 12) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 69 bases 
1S < B) TYPE : nucleic acid 

(C, STRANDEDNESS : single 
<D) TOPOLOGY: linear 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37 : 
CCAGCGCCGC CAGCCTGTGA CCCCCGACTC CTAAATAAAC TGCCTCGTGA 50 
TGACCACGTT CAGCACGGC 69 



(2) INFORMATION FOR SEQ ID NO: 38: 

30 U) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 69 bases 

<B) TYPE: nucleic acid 

fC) STRANDEDNESS: single 
35 < D > TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
4o GGTCGCGGCG GTCGGACACT GGGGGCTGAG GATTTATTTG ACGGAGCACT 50 



ACTGGTGCAA GTCGTGCCG 69 



45 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS * 
(A) LENGTH: 69 bases 
=>° (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

5s (xi) SEQUENCE DESCRIPTION : SEQ ID NO:39: 

CCAGCACCTC CGGCATGTGA CCCCCGACTC CTAAATAAAC TGCITCG^A 50 
6 0 CGACCACGTC CATCACGGC 69 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS- 
{ A ) " LENGTH : 69 basef 
{*> TYPE: nucleic acid 
C STRANDEDNESS: single 
APOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
1 5 GGTCGTGGAG GCCGTACACT GGGGGCTGAG GATTTATTTG ACGAAGCACT 50 
GCTGGTGCAG GTAGTGCCG 69 



1 0 



20 



(2) INFORMATION FOR SEQ ID NO: 41; 

(i) SEQUENCE CHARACTERISTICS * 
? r fA > LENGTH : 69 bases 

" ( fi > WE: nucleic acid 

fO STRANDEDNESS: single 
<D) TOPOLOGY: linear 

3o (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CCAGCACCGC CGGCATGTGA CCCCCGACTC CTAAATAAAC TGCTTCGTGA 50 
3 5 CGATCATGTC TATCACGGT 69 



(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 69 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY : linear 



50 



55 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 42: 
GGTCGTGGCG GCCGTACACT GGGGGCTGAG GATTTATTTG ACGAAGCACT 50 
GCTAGTACAG ATAGTGCCA 69 

(2) INFORMATION FOR SEQ ID NO: 43: 



(l) SEQUENCE CHARACTERISTICS- 
60 (A) LENGTH: 37 bases 

(B) TYPE: nucleic acid 
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1 0 



(O STRANDEDNESS; single 
<D) TOPOLOGY: linear 

(xi, SEQUENCE DESCH:pt I0N: Seq „ 
GCTAGCTCTA GAAATTGCTC CTCGTGGTCA TGCTTCT 37 

(2) INFORMATION FOR SEQ ID NO: 44: 

fi) SEQUENCE CHARACTERISTICS • 
i e A) ^NGTH: 22 bases 

15 B) TYPE : nucleic acid 

(CJ STRANDEDNESS: single 

(D) TOPOLOGY: linear 

2Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

CAGTCTGCCG TGAAGGACAT GG 22 

25 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS • 
« n < A > LENGTH: 24 bases 

JU TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3s (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

TGTGGACTTT AGCTTGGGAG AATG 24 



40 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS • 
ac (A) LENGTH: 22 bases 

45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5Q (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 46: 

GGTCCAGGGA CCTGGAGGTT TG 22 



55 



60 



(2) INFORMATION FOR SEQ ID NO:47 : 

<i> SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 31 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY; linear 
(xi) SEQU^CE DESCRIPTION: SEQ 1D NO:47; 

ATCGATATCG ATAGCCAGAC ACCCCGGCCA G 31 
C2) INFORMATION FOR SEQ ID NO: 48: 



<U SEQUENCE CHARACTERISTICS- 
LENGTH: 36 bases 
15 ~? WPE: ™cleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 
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SEQUENCE DESCRIPTION: SEQ ID NO:48: 
GCTAGCTCTA GACAGGGAAG GGAGCTGTAC ATGAGA 36 

(2} INFORMATION FOR SEQ ID NO: 49; 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
CTCCTTGGAA CCCAGGGCAG GACC 24 

(2) INFORMATION FOR SEQ ID NO: 50: 



(D SEQUENCE CHARACTERISTICS * 
(A) LENGTH: 24 bases 
d c fB i ^PE: nucleic acid 

fC > STRANDEDNESS : single 
(D) TOPOLOGY: linear 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOrSO: 
GGTCCTGCCC TGGGTTCCAA GGAG 24 



5 5 (2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS * 
(A) LENGTH: 27 bases 
6n (B) TYPE: nucleic acid 

OU (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0 :5l: 
5 CTGCTCCGAG GAAAGGACTT CTGGATT 27 

10 INFORMATI0 ^ORSEQ IDNO: 52: 

(i) SEQUENCE CHARACTERISTICS: 
A LENGTH: 27 bases 
B TYPE: nucleic acid 
15 n S TRANDEDNE SS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
2 0 AATCCAGAAG TCCTTTCCTC GG AG C AG 27 

25 (2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 28 bases 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 

JU fD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5 3: 
3 5 CCCTCTGCGT CGCGGCGGCC CCACCCAC 28 

4o (2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 28 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
* b f D > TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
GTGGGTGGGG CCGCCGCGAC GCAGAGGG 28 



50 
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(2) INFORMATION FOR SEQ ID NO:55 : 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 35 bases 

(B) TYPE: nucleic acid 
fin (C) STRANDEDNESS : single 
60 (D) TOPOLOGY: linear 



-187- 



WO 95/18858 



PCT/US94/I4553 



(xi) SEQUENCE DESCRIPTION: SEQ ID K 
GACTCGAGGA TCCATCGATT TTTTTTTTTT TTT 



(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 17 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<*i) SEQUENCE DESCRIPTION: SEQ ID 
GACTCGAGGA TCCATCG 17 



(2) INFORMATION FOR SEQ ID NO: 57: 

<i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 32 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION; SEQ ID 



GCTAGCTCTA GAAGCCCGGC TCCTCCTGCC 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS * 
(A) LENGTH: 21 bases 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID 



CGAAATTAAC CCTCACTAAA G 21 



(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 103 bases 
(BJ TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID 
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TGCAGCAAGG GCTACTGCCA CACTCrir~r ^ 

^ACTCGAGcT GCGCAGATGC TAGCCTCAAG 50 

ATGGCTGATC CAAATCGAT^ CCCCrKr*** 

L CCGCGGCAAA GATCTTCCGG TCCTGTAGAA 100 



GCT 103 



(2) INFORMATION FOR SEQ ID NO: 60: 

15 (i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 103 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
2Q (DJ TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
^ AGCTTCTACA GGACCGGAAG ATCTTTGCCG CGGAATCGAT TTGGATCAGC 50 

CATCTTGAGG CTAGCATCTG CGCAGCTCGA GTGTGGCAGT AGCCCTTGCT 100 
3 0 GCA 103 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 18 bases 

<B) TYPE: nucleic acid 
4 0 IS! STRANDED NESS: single 

40 <D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
4 5 TCTCGCTACC GTTTACAG 18 



50 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 25 bases 

(B) TYPE: nucleic acid 
cc (C) STRANDEDNESS : single 

05 <D> TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

6 0 CAGGTACCCA CCAGGCGGTC TCGGT 25 
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(2) INFORMATION FOR SEQ ID NO: 63: 

5 (i) SEQUENCE CHARACTERISTICS : 

A LENGTH: 13 bases 
3 TYPE: nucleic acid 
(C) STRANDEDNESS : single 
10 (D} T °POLOGY: l inear 

<*i> SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
1 5 GGGCCATGAC ACTGTCAA 18 



20 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
3q GACCGCCACC GAGACCGCCT GGTGGGTACC TGTGGTCCTT 40 

(2) INFORMATION FOR SEQ ID NO: 65: 

3 5 <i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 32 bases 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS: single 

4Q < D > TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
45 ATCGATATCG ATCAGCCAGA CACCCCGGCC AG 32 



50 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 35 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 < D > TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
60 TCTAGATCTA GATCACCTGA CGCAGAGGGT GGACC 35 
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(2) INFORMATION FOR SEQ ID MO:67 : 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 33 bases 
* WE: nucleic acid 
C STRANDEDNESS : single 
<D> TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION : SEQ 1D NO:67: 
AGTCGACGTC GACGTCGGCA GTGTCTGAGA ACC 33 

(2) INFORMATION FOR SEQ ID NO:6 8: 

(i) SEQUENCE CHARACTERISTICS: 
,A > LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
AGTCGACGTC GACTCACCTG ACGCAGAGGG TGGACC 36 



25 



30 



(2) INFORMATION FOR SEQ t d N0:6 g. 

(i) SEQUENCE CHARACTERISTICS ■ 
35 (A) LENGTH : 62 bases 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



40 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO:69: 

CGCGTATGCC AGCCCGGCTC C<rrrn*nr*™~ 

WJGCTC CTCCTGCTTG TGACCTCCGA GTCCTCAGTA 50 



AACTGCTTCG TG 62 



50 



(2) INFORMATION FOR SEQ ID NO:70: 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 61 bases 
5 5 < B) WPE: nucleic acid 

55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7.0: 



60 



ATACGGTCGG GCCGAGGAGG ACGAACACTG GAGCC*** ACtt^ 50 
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CGAAGCACTG A 61 

5 

(2) INFORMATION FOR SEQ ID NO:7l : 

10 U) ff?^ CE ^CHARACTERISTICS- 

° A LENGTH: 37 bases 

B TYPE: nucleic acid 

D T S n T " ES S: single 
(D) TOPOLOGY: linear 

«5 (xi) SEQUENCE DESCRIPTION: SEQ tD N 0:71: 

CTAGAATTAT GAAAAAGAAT ATCGCATTTC TTCTTAA 37 



20 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS • 
" <A) LENGTH: 37 bases 

<B) TYPE: nucleic acid 
CO STRAND EDNESS : single 
(D) TOPOLOGY: linear 

3 0 <xi> SEQUENCE DESCRIPTION : SEQ ID NO: 72: 

TTAATACTTT TTCTTATAGC GTAAAGAAGA ATTGCGC 37 

35 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i> SEQUENCE CHARACTERISTICS- 
4U (A) LENGTH: 69 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: li near 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7 3: 



45 



50 



CTAGAATTAT GAAAAAGAAT ATCGCATTTC ATCACCATCA CCATCACCAT SO 
CACATCGAAG GTCGTAGCC 69 



5 5 (2) INFORMATION FOR SEQ JD NO:74: 

(i> SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 62 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
TTAATACTTT TTCTTATAOC GTAAAGTAGT GGTAGTGGTA GTGGTAGTGT 50 



55 
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AGCTTCCAGC AT 62 



(2) INFORMATION FOR SEQ ID NO: 75: 

fi) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 69 bases 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CTAGAATTAT GAAAAAGAAT ATCGCATTTC ATCACCATCA CCATCACCAT 50 
CACATCGAAC CACGTAGCC 69 



3 0 (2) INFORMATION FOR SEQ ID NO: 76: 

(i> SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 62 bases 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

TTAATACTTT TTCTTATAGC GTAAAGTAGT GGTAGTGGTA GTGGTAGTGT 50 
AGCTTGGTGC AT 62 

(2) INFORMATION FOR SEQ ID NO: 77: 

5 0 (i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 19 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77; 
TCCACCCTCT GCGTCAGGT 19 
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(2) INFORMATION FOR $EQ ID NO:7 8: 

(i) SEQUENCE CHARACTERISTICS. 
(A) LENGTH : 18 bases 
JB) TYPE: nucleic acid 

(C) STRANDEDNESS: s<n^e 

(D) TOPOLOGY: lineal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 
GGAGACGCAG TCCATCGA 18 

(2) INFORMATION FOR SEQ ID NO:79 : 



(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH : 62 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 
GCAGCAGTTC TAGAATTATG TCNCCNGCNC CNCCNGCNTG TGACCTCCGA 50 



GTTCTCAGTA AA 62 



3 5 (2 J INFORMATION FOR SEQ ID NO: 80: 

U) SEQUENCE CHARACTERISTICS * 
(A) LENGTH: 49 bases 
40 JB) TYPE: nucleic acid 

* U < c > STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
ACACTGGAGG CTCAAGAGTC ATTTGACGAA GCACTGAGGG TACAGGAAG 49 



50 (2) INFORMATION FOR SEQ ID NO:81: 

(i) SEQUENCE CHARACTERISTICS • 
<A> LENGTH: 45 bases 
cc (B) ^PE: nucleic acid 

0:3 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81; 
CTAGAATTAT GAAAAAGAAT ATCGCATTTA TCGAAGGTCG TAGCC 45 
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(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS ■ 
(A) LENGTH: 3 8 bases 
JB) TYPE: nucleic acid 
C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



<*i> SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
1 5 TTAATACTTT TTCTTATAGC GTAAATAGCT TCCAGCAT 38 

2q (2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 4 5 bases 

(B) TYPE: nucleic acid 
< c > STRANDEDNESS: single 

" (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
3 0 CTAGAATTAT GAAAAAGAAT ATCGCATTTC TTCTTAAACG TAGCC 4 5 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS ■ 

(A} LENGTH: 3 8 bases 

(B) TYPE: nucleic acid 
An (c > STRANDEDNESS: single 

40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 
4 5 TTAATACTTT TTCTTATAGC GTAAAGAAGA ATTTGCAT 38 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Met Lys Lys Asn lie Ala Phe Leu Leu Asn Ala Tyr Ala Ser Pro 
60 5 10 15 
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Ala Pro Pro Ala Cys Cys V al Arc Arg Ala 
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5 (2) INFORMATION FOR SEQ ID NO : 8 6 : 
(i) SEQUENCE CHARACTERISTICS ■ 

B TV?E™ : 31 «^ 
1Q TOPOLOGY: l inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-86: 
«et Lys Lys As„ lie Ala . Phe His His His His ^ ^ ^ 
15 3 10 15 

116 S ? 0 P " Ala P - Ala Cys Cys Val Arg Ax 3 



15 

Cys Cys Val Arg A 

25 30 
Ala 

20 3i 
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(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH : 31 anino acids 
(B> TYPE: amino acid 
<D> TOPOLOGY: linear 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
Met Lys Lys Asn lie Ala Phe His His HI. His His His His His 
35 He Glu Pro Ar g Ser Pro Ala Pro Pro Ala Cys Cys Val Ar g £ 



Ala 

31 



25 30 



4 0 (2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS * 
(A) LENGTH: 7 amino acids 
4C < B > TYPE: amino acid 

4& (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Thr Thr Ala His Lys Asp Pro 
50 1 5 7 

(2) INFORMATION FOR SEQ ID NO: 89: 

( i ) SEQUENCE CHARACTERISTICS • 
(Ai LENGTH: 4 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

eo (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
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1 4 



5 12 » FORMATION FOR SEQ : D NO:90; 

(i) SEQUENCE CHARACTERISTICS: 

J Se TH: 4 amir -° acids 

o XTPE: ammo acid 
10 (D) TOp OLOGYr linear 

SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



1 5 



Ser Arg Leu Ser 
1 4 
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(2) INFORMATION FOR SEQ ID NO: 91: 

<i) SEQUENCE CHARACTERISTICS • 
? n (A) LENGTH: 4 amino acids 

C " <B> TYPE: amino acid 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:9l : 
2 5 ser His Val Leu 
1 4 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 92; 
His Ser Arg Leu 
1 4 

4 0 (2) INFORMATION FOR SEQ ID NO; 93: 

<i> SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 4 amino acids 
A c (B) 7r?E: amino acid 

° <D> TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO;93: 

Ala Val Asp Phe 
50 i 4 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS • 
35 (A) LENGTH : 4 amino acids 

fB> TYPE: amino acid 
(D) TOPOLOGY: linear 

6Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
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Ser Leu Gly Glu 
1 4 

[2) INFORMATION FOR SEQ ID NO: 95; 

(i) SEQUENCE CHARACTERISTICS ■ 
A) LENGTH: 4 amino acids 
(B) TYPE: amino acid 
<I» TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 95: 
Ala Val Thr Leu 



1 
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(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 
9ft (A) LENGTH: 4 amino acids 

^ U (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: 

2 5 Leu Leu Glu Gly 

1 4 

(2) INFORMATION FOR SEQ ID NO: 97: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 ajnino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Leu Ser Ser Leu 
1 4 

4 0 (2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
45 <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Leu Gly Gin Leu 
50 i 4 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS* 
55 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
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Cys Xaa Leu Ser Ser 
1 5 



g (2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS • 
LENGTH: 4 amino acids 
(B) TYPE: ammo acid 
10 {D) APOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: 



1 5 



Leu Leu Gly Gin 
1 4 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS * 
on (A > LENGTH: 4 amino acids 

£V (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

2 5 Ser Ser Leu Leu 

1 4 

(2) INFORMATION FOR SEQ ID NO: 102: 

3 0 (i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

3 5 ( Xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Gly Gin Leu Ser 
1 4 

4 0 (2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 4 amino acids 
. (B) TYPE: amino acid 

45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Cys Leu Ser Ser 
50 i 4 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS • 
55 < A > LENGTH : 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

6q (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 : 
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Leu Gin Ser Leu 
1 4 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
Leu Gly Thr Gin 



1 



4 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 106: 

2 5 Ala Leu Gin Ser 

1 4 

(2) INFORMATION FOR SEQ ID NO: 107: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

Leu Leu Gly Thr 
1 4 

4 0 (2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10B: 

Asn Ala lie Phe 
50 1 4 

(2) INFORMATION FOR SEQ ID NO: 109 : 

<i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 : 
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Leu Ser Phe Gin 
1 4 

(2) INFORMATION FOR SEQ ID NO: 110: 

<i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 22 amino acids 
(Bi TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
Ser Pro Ala Pro Pro Ala Cys Asp Leu Arg Val Leu Ser Lys Leu 
15 5 10 15 

Leu Arg Asp Ser His Val Leu 
20 22 



1 0 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 111; 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:lll: 

His Ser Arg Leu Ser Gin Cys Pro Glu Val His Pro Leu Pro Thr 
30 5 10 15 

Pro Val Leu Leu Pro Ala Val Asp Phe 
20 24 

<2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Ser Leu Gly Glu Trp Lys Thr Gin Met Glu Glu Thr Lys Ala Gin 
45 5 10 15 

Asp He Leu Gly Ala Val Thr Leu 
20 23 



35 



40 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Leu Leu Glu Gly Val Met Ala Ala Arg Gly Gin Leu Gly Pro Thr 
60 5 10 15 
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Cys Leu Ser Ser Leu Leu 

20 21 

(2) INFORMATION FOR SEQ ID HO: 114 : 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Gly Gin Leu Ser Gly Gin Val Arg Leu Leu Leu Gly Ala Leu Gin 
15 5 10 15 

Ser 
16 



1 0 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 115 : 
Leu Leu Gly Thr Gin Leu Pro Pro Gin Gly Arg Thr Thr Ala His 
30 5 10 « 

Lys Asp Pro Asn Ala He Phe 
20 22 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
Leu Ser Phe Gin His Leu Leu Arg Gly Lys Val Arg Phe Leu Met 
45 5 10 15 

Leu Val Gly Gly Ser Thr Leu Cys Val Arg 
20 25 



35 



40 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Met Pro Pro Ala 
1 4 
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(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 5 amino ac^ds 
5 (B) TYPE: amino acid 

(Dh- TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: 

1 0 Met Ala Pro Pro Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 119: 

15 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Met Pro Ala Pro Pro Ala 
1 5 6 

2 5 (2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Met Ser Pro Ala Pro Pro Ala 
35 i 5 7 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 121 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 121: 

Ala Pro Pro Ala 
1 4 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

Pro Ala Pro Pro Ala 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 6 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 123: 

1 0 Ser Pro Ala Pro Pro Ala 
1 5 6 

(2) INFORMATION FOR SEQ ID NO: 124: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 amino acids 

(B) TYPE : amino acid 
<D) TOPOLOGY : linear 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Val Arg Arg Ala 
1 4 

2 5 (2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 

3 0 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

Val Arg Arg Ala Pro 
35 1 5 

(2) INFORMATION FOR SEQ ID NO: 12 6: 

(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



45 



50 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: 

Val Arg Arg Ala Pro Pro 
1 5 6 

(2) INFORMATION FOR SEQ ID NO: 127 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

Val Arg Arg Ala Pro Pro Thr 

1 5 7 
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1 0 



(2) INFORMATION FOR SEQ ID NO; 128: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 8 ammo acids 
<B> TYPE: amino acid 
(Dh TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Val Arg Arg Ala Pro Pro Thr Thr 
1 5 8 

(2) INFORMATION FOR SEQ ID NO: 129: 

15 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

Val Arg Arg Ala Pro Pro Thr Thr Ala 
1 5 9 

2 5 (2) INFORMATION FOR SEQ ID NO: 13 0: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 amino acids 
<B) TYPE: amino acid 

3 0 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 : 

Val Arg Arg Ala Pro Pro Thr Thr Ala Val 
35 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 11 amino acids 

{B> TYPE: amino acid 
(D) TOPOLOGY: linear 



45 



50 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro 
1 5 10 11 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro Ser 
1 5 10 12 
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1 0 



(2) INFORMATION FOR SEQ ID NO: 133: 

<x> SEQUENCE CHARACTERISTICS- 

!o! Jf? GTH: 13 ara ^o acids 
(B> ?YPE: amino acid 
'D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro Ser Arg 
5 10 13 

(2) INFORMATION FOR SEQ ID NO: 134: 

15 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

2 0 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro Ser Arg Thr 
5 1° 14 

2 5 (2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 amino acids 
<B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ IDNO:135: 

Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro Ser Arg Thr Ser 
5 10 



15 



(2) INFORMATION FOR SEQ ID NO : 13 6: 

40 (i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 6: 



Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro Ser Arg Thr Ser 
5 « 15 

5 0 Leu 
16 



(2) INFORMATION FOR SEQ ID NO: 137 ; 

5 5 (i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

6 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
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Val Arg Arg Ala Pro Pro Th- Thr v= i n 

1 , ir4i inr Val Pro Ser Arg Thr Ser 

5 10 15 

Leu Val 
5 17 

(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS ■ 
10 < A > LENGTH : 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:138: 

Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro Ser Arg Thr Ser 
1 5 10 15 



20 



30 



45 



50 



Leu Val Leu 
18 



(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 
25 <A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro Ser Arg Thr Ser 
15 10 15 



Leu Val Leu Thr 
35 19 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140 : 

Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro Ser Arg Thr Ser 
15 10 15 

Leu Val Leu Thr Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 141: 



(i> SEQUENCE CHARACTERISTICS: 
5 5 (A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:141: 
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Val Arg Arg Ala Pro Pro Th- tk>. s1 „ , 

i-ro Ttu Thr Ala Val Pro Ser Arg Thr Ser 

5 io 15 

Leu Val Leu Thr Leu Asn 
5 20 21 

(2) INFORMATION FOR SEQ ID NO:142: 

(i) SEQUENCE CHARACTERISTICS • 
10 < A) LENGTH: 22 amino acids 

<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro Ser Arg Thr Ser 

Leu Val Leu Thr Leu Asn Glu 
20 22 

(2) INFORMATION FOR SEQ ID NO: 143 : 

(ij SEQUENCE CHARACTERISTICS • 
25 (A) LENGTH: 23 amino acids 

<B> TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



30 



35 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro Ser Arg Thr Ser 



5 10 15 



Leu Val Leu Thr Leu Asn Glu Leu 
20 23 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS' 
40 <A) LENGTH: 24 amino acids 

(BJ TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
Val Arg Arg Ala Pro Pro Thr Thr Ala Val Pro Ser Arg Thr Ser 



10 15 



Leu Val Leu Thr Leu Asn Glu Leu Pro 
20 24 
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CLAIMS 

We claim: 



1. 



5 



2. 



1 0 



1 5 



30 



3. 



An isolated substantially homogeneous mpl ligand polypeptide. 

The mpl ligand polypeptide of Claim 1 selected from the group consisting of 

(a) a fragment polypeptide; 

( b ) a variant polypeptide; and 

(c) a chimeric polypeptide. 

The mpl ligand polypeptide of Claim 1 selected from the group consisting of 

(a ) the polypeptide that is isolated from a mammal; 

( b ) the polypeptide that is made by recombinant means; and 

( c ) the polypeptide that is made by synthetic means. 

The mpl ligand polypeptide of Claim 1 selected from the group consisting of 
( a ) the polypeptide that is human; and 
{ b) the polypeptide that is non- immunogenic in a human. 

An isolated substantially homogeneous mpl agonist characterized in that: 

(a) the agonist stimulates the incorporation of labeled nucleotides (3h- 
thymidine) into the DNA of IL-3 dependent Ba/F3 cells transfected with human 
mpl P; or 

(b) the agonist stimulates 35 s incorporation into circulating platelets 
2 5 in a platelet rebound assay. 



4. 



20 5. 



6. 



A fragment polypeptide according to Claim 2 represented by 
X-hTPO(7-151)-Y 

where 

hTPO(M51) represents the human TPO (hML) amino acid sequence 
from Cys 7 through Cys 151 inclusive; 

X represents an amino group of Cys 7 or amino-terminus amino acid 
residue(s) selected from the group 

M, 
MA, 
MPA, 

MPPA, (SEQIDNO: 117) 
MAPPA, (SEQ ID NO: 118) 
-209- 
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MPAPPA, (SEQ ID NO: 119) 
MSPAPPA, (SEQ ID NO: 120) 
A. 
PA, 
PPA, 

APPA, (SEQ ID NO: 121) 
PAPPA, (SEO ID NO: 122) 
SPAPPA, (SEQ ID NO: 123) 
Y represents the carboxy terminal group of Cysisi or carboxy . 
terminus amino acid resldue(s) selected from the group 
V, 
VR, 
VRR, 

VRRA, (SEQ ID NO; 124) 
VRRAP, (SEQ ID NO: 125) 
VRRAPP, (SEQ ID NO: 126) 
VRRAPPT, (SEQ ID NO: 127) 
VRRAPPTT, (SEQ ID NO: 128) 
VRRAPPTTA, (SEQ ID NO: 129) 
VRRAPPTTAV, (SEQ ID NO: 1 30) 
VRRAPPTTAVP, (SEQ ID NO: 131) 
VRRAPPTTAVPS, (SEQ ID NO: 132) 
VRRAPPTTAVPSR. (SEQ ID NO: 133) 
VRRAPPTT AVPSRT, (SEQ ID NO: 134) 
VRRAPPTTAVPSRTS, (SEQ ID NO: 135) 
VRRAPPTTAVPSRTSL, (SEQ ID NO: 136) 
VRRAPPTTAVPSRTSLV, (SEQ ID NO: 137) 
VRRAPPTTAVPSRTSLVL, (SEQ ID NO: 138) 
VRRAPPTTAVPSRTSLVLT, (SEQ ID NO: 139) 
VRRAPPTTA VPSRTSLVLTL, (SEQ ID NO: 140) 
VRRAPPTTAVPSRTSLVLTLN, (SEQ ID NO: 141) 
VRRAPPTTAVPSRTSLVLTLNE, (SEQ ID NO: 142) 
VRRAPPTTAVPSRTSLVLTLNEL, (SEQ ID NO: 143) 
VRRAPPTTAVPSRTSLVLTLNELP, (SEQ ID NO- 144) 
aminc-.erminus amino acid residue extensions comprising one or more of the 
residues 176-332 of human ML as provided in Fig. 1 (SE Q | D N0 - 1) and 
pegylated forms thereof. ' 



-210- 



WO 95/18858 



PCT/US94/14553 



7. A fragment polypeptide according ,o Claim 6 selected from the group TPO<1- 
153) and TPO(1-245). 

5 8. A fragment polypeptide according to Claim 2. wherein the amino acid sequence 
of the fragment polypeptide comprises 
SPAPPACDLRVLSKLLRDSHVL, 
(SEQ ID NO: 110) 

HSRLSQCPEVHPLPTPVLLPAVDF. 
1 ° (SEQ ID NO: 111) 

S LG EWKTQM EETKAQDI LG AVTL, 
(SEQ ID NO: 112) 

LLEG VMAARG QLG PTCLSSLL, 
(SEQ ID NO: 113) 
1 5 GQLSGQVRLLLGALQS, 
(SEQ ID NO: 114) 

LLGTQLPPQGRTTAHKDPNAIF, 
(SEQ ID NO: 115) 

LSFQHLLRGKVRFLMLVGGSTLCVR, and 
2 0 (SEQ ID NO: 116) 

combinations thereof. 

9. The polypeptide of Claim 6 that is unglycosylated. 

25 10. An isolated polypeptide encoded by a nucleic acid having a sequence that 
hybridizes under moderately stringent conditions to the nucleic acid molecules 
having a nucleic acid sequence provided in Fig. 1 (SEQ ID NO: 2). 

11. The polypeptide of Claim 1 1 that is biologically active. ' 

12. The polypeptide of Claim 1 selected from the group hML, hMLi 53 
hML(R153A, R154A). hML2, hML3, hML4, mML, mML2. mML3, pML, and 
pML2. 



30 



35 13. 



A polypeptide according to Claim 2. wherein the amino acid sequence of the 
polypeptide comprises amino acid residues 1 to X of Fig. 1 (SEQ ID NO: 1) 
where X is selected from the group 153, 155. 164, 174, 191. 205 20?' 
217, 229, 245 and 332. ' 
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1 4. An isolated substantially homogeneous mpl ligand polypeptide sharing at least 
80% sequence identity with the polypeptide of Claim 13. 

5 15. The polypeptide of Claim 1 3 wherein X is 1 53. 

16. A chimera comprising the mpl ligand of Claim 13 fused to a heterologous 
polypeptide. 

10 17. The chimera of Claim 16 wherein the heterologous polypeptide is an 

immunoglobin polypeptide. 

18. The chimera of Claim 16 wherein the heterologous polypeptide is an 
interleukin polypeptide. 

1 5 

19. A chimera comprising the N-terminus residues 1 to about 153 to 157 of hML 
substituted with one or more, but not all. of the human EPO residues added or 
substituted into the N-terminus residues of hML at positions corresponding to 
the alignment shown in Fig. 10. 

20 

20. An antibody that is capable of binding the mpl ligand polypeptide of Claim 13. 

21 . A hybridoma cell line producing the antibody of Claim 20. 

25 22. An isolated nucleic acid molecule encoding the mpl iigand polypeptide of Cairn 

23. An isolated nucleic acid molecule encoding the mpl .igand polypeptide of Claim 
13. 

30 

24. An isolated nucleic acid molecule comprising the open reading frame nucleic 
acid sequence shown in Fig. 1 (SEQ ID NO: 2) 

25. The isolated nucleic acid molecule of Claim 24 encoding a mpl ligand 
35 polypeptide selected from the group hML. hML 153 , hML(R153A, R154A). 

HML2, HML3, hML4, mML, mML2. mML3. pML, and pML2. 
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26. An isolated nucleic acid molecule selected from the group consisting of 

( a ) a cDNA clone comprising the nucleotide sequence of the coding region of 
the mpl ligand gene; 

(b) a DNA sequence capable of hybridizing under stringent conditions to a 
5 clone of (a); and 

(c) a genetic variant of any of the DNA sequences of (a) and (b) which 
encodes a polypeptide possessing a biological property of a naturally occurring 
mpl ligand polypeptide. 

1 0 27. An isolated DNA molecule having a sequence capable of hybridizing to a DNA 
sequence provided in Fig. 1 (SEQ ID NO: 2) under moderately stringent 
conditions, wherein the DNA molecule encodes a biologically active mpl ligand 

polypeptide. 

1 5 28. The nucleic acid molecule of Claim 25 further comprising a promoter operably 

linked to the nucleic acid molecule. 

29. An expression vector comprising the nucleic acid sequence of Claim 25 
operably linked to control sequences recognized by a host cell transformed with 

2 0 the vector. 

30. A host cell transformed with the vector of Claim 29. 



25 



30 



35 



31. A process of using a nucleic acid molecule encoding the mpl ligand polypeptide 
to effect production of the mpl ligand polypeptide comprising culturing the host 
cell of Claim 30. 



32. The process of Claim 31 wherein the mpl ligand polypeptide Is recovered from 
the host cell. 



33. The process of Claim 31 wherein the mpl ligand polypeptide is recovered from 
the host cell culture medium. 



34. 



A method of determining the presence of mpl ligand polypeptide, comprising 
hybridizing DNA encoding the mpl ligand polypeptide to a test sample nucleic 
acid and determining the presence of mpl ligand polypeptide DNA. 
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35. 



36. 



37. 



1 0 

38. 



1 5 39. 



A method of amplifying a nucleic acid test sample comprising priming a nucleic 
ac,d polymerase reaction with nucieic acid encoding a mpl ligand polypeptide. 

A composition comprising the mp, lig and polypeptide of Claim 1 and a 
pharmaceutical^ acceptable carrier. 

A method for treating a mammal having or at risk for thrombocytopenia 
comprising administering to a mamma, in need of such treatment a 
therapeutically effective amount of the composition of Claim 36. 

The composition of Claim 36 further comprising a therapeutically effective 
amount of an agent selected from the group consisting of a cytokine, colony 

stimulating factor, and interleukin. 

The composition of Calm 38 wherein the agent is selected from KL, LIF G- 
CSF. GM-CSF, M-CSF, EPO. IL-1. IL-2. IL-3. IL-5, IL-6, .L-7. IL-8, '.L-9 
and IL-1 1 . 



40. 



20 



25 



A process for the biosynthesis of a human mp, ,i gand polypeptide 
characterized by the amino acid sequence of one human mp, ligand polypeptide' 
such as that disclosed in Fig. 1 (SEQ ID NO: 1), comprising: 

(a) growing a host cell culture containing a DNA isolate encoding the 
human mpl ligand polypeptide, 

(b) recovering the human mp/ ligand polypeptide from the culture and 

(c) purifying the mpl ligand polypeptide so as to obtain substantially 
homogeneous biologically active human mp/ligand polypeptide 
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